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Introduction 

In 1965 I first taught an undergraduate course in abstract algebra. It was fun to 
teach because the material was interesting and the class was outstanding. Five of 
those students later earned a Ph.D. in mathematics. Since then I have taught the 
course about a dozen times from various texts. Over the years I developed a set of 
lecture notes and in 1985 I had them typed so they could be used as a text. They 
now appear (in modified form) as the first five chapters of this book. Here were some 
of my motives at the time. 

1) To have something as short and inexpensive as possible. In my experience, 
students like short books. 

2) To avoid all innovation. To organize the material in the most simple-minded 

straightforward manner. 

3) To order the material linearly. To the extent possible, each section should use 
the previous sections and be used in the following sections. 

4) To omit as many topics as possible. This is a foundational course, not a topics 
course. If a topic is not used later, it should not be included. There are three 
good reasons for this. First, linear algebra has top priority. It is better to go 
forward and do more linear algebra than to stop and do more group and ring 
theory. Second, it is more important that students learn to organize and write 
proofs themselves than to cover more subject matter. Algebra is a perfect place 
to get started because there are many "easy" theorems to prove. There are 
many routine theorems stated here without proofs, and they may be considered 
as exercises for the students. Third, the material should be so fundamental 
that it be appropriate for students in the physical sciences and in computer 
science. Zillions of students take calculus and cookbook linear algebra, but few 
take abstract algebra courses. Something is wrong here, and one thing wrong 
is that the courses try to do too much group and ring theory and not enough 
matrix theory and linear algebra. 

5) To offer an alternative for computer science majors to the standard discrete 
mathematics courses. Most of the material in the first four chapters of this text 
is covered in various discrete mathematics courses. Computer science majors 
might benefit by seeing this material organized from a purely mathematical 
viewpoint. 
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Over the years I used the five chapters that were typed as a base for my algebra 
courses, supplementing them as I saw fit. In 1996 I wrote a sixth chapter, giving 
enough material for a full first year graduate course. This chapter was written in the 
same "style" as the previous chapters, i.e., everything was right down to the nub. It 
hung together pretty well except for the last two sections on determinants and dual 
spaces. These were independent topics stuck on at the end. In the academic year 
1997-98 I revised all six chapters and had them typed in LaTeX. This is the personal 
background of how this book came about. 

It is difficult to do anything in life without help from friends, and many of my 
friends have contributed to this text. My sincere gratitude goes especially to Marilyn 
Gonzalez, Lourdes Robles, Marta Alpar, John Zweibel, Dmitry Gokhman, Brian 
Coomes, Huseyin Kocak, and Shulim Kaliman. To these and all who contributed, 
this book is fondly dedicated. 

This book is a survey of abstract algebra with emphasis on linear algebra. It is 
intended for students in mathematics, computer science, and the physical sciences. 
The first three or four chapters can stand alone as a one semester course in abstract 
algebra. However they are structured to provide the background for the chapter on 
linear algebra. Chapter 2 is the most difficult part of the book because groups are 
written in additive and multiplicative notation, and the concept of coset is confusing 
at first. After Chapter 2 the book gets easier as you go along. Indeed, after the 
first four chapters, the linear algebra follows easily. Finishing the chapter on linear 
algebra gives a basic one year undergraduate course in abstract algebra. Chapter 6 
continues the material to complete a first year graduate course. Classes with little 
background can do the first three chapters in the first semester, and chapters 4 and 5 
in the second semester. More advanced classes can do four chapters the first semester 
and chapters 5 and 6 the second semester. As bare as the first four chapters are, you 
still have to truck right along to finish them in one semester. 

The presentation is compact and tightly organized, but still somewhat informal. 
The proofs of many of the elementary theorems are omitted. These proofs are to 
be provided by the professor in class or assigned as homework exercises. There is a 
non-trivial theorem stated without proof in Chapter 4, namely the determinant of the 
product is the product of the determinants. For the proper flow of the course, this 
theorem should be assumed there without proof. The proof is contained in Chapter 6. 
The Jordan form should not be considered part of Chapter 5. It is stated there only 
as a reference for undergraduate courses. Finally, Chapter 6 is not written primarily 
for reference, but as an additional chapter for more advanced courses. 



This text is written with the conviction that it is more effective to teach abstract 
and linear algebra as one coherent discipline rather than as two separate ones. Teach- 
ing abstract algebra and linear algebra as distinct courses results in a loss of synergy 
and a loss of momentum. Also with this text the professor does not extract the course 
from the text, but rather builds the course upon it. I am convinced it is easier to 
build a course from a base than to extract it from a big book. Because after you 
extract it, you still have to build it. The bare bones nature of this book adds to its 
flexibility, because you can build whatever course you want around it. Basic algebra 
is a subject of incredible elegance and utility, but it requires a lot of organization. 
This book is my attempt at that organization. Every effort has been extended to 
make the subject move rapidly and to make the flow from one topic to the next as 
seamless as possible. The student has limited time during the semester for serious 
study, and this time should be allocated with care. The professor picks which topics 
to assign for serious study and which ones to "wave arms at". The goal is to stay 
focused and go forward, because mathematics is learned in hindsight. I would have 
made the book shorter, but I did not have any more time. 

When using this text, the student already has the outline of the next lecture, and 
each assignment should include the study of the next few pages. Study forward, not 
just back. A few minutes of preparation does wonders to leverage classroom learning, 
and this book is intended to be used in that manner. The purpose of class is to 
learn, not to do transcription work. When students come to class cold and spend 
the period taking notes, they participate little and learn little. This leads to a dead 
class and also to the bad psychology of "OK, I am here, so teach me the subject." 
Mathematics is not taught, it is learned, and many students never learn how to learn. 
Professors should give more direction in that regard. 

Unfortunately mathematics is a difficult and heavy subject. The style and 
approach of this book is to make it a little lighter. This book works best when 
viewed lightly and read as a story. I hope the students and professors who try it, 
enjoy it. 

E. H. Connell 

Department of Mathematics 
University of Miami 
Coral Gables, FL 33124 
ec@math.miami.edu 
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Abstract algebra is not only a major subject of science, but it is also 
magic and fun. Abstract algebra is not all work and no play, and it is 
certainly not a dull boy See, for example, the neat card trick on page 
18. This trick is based, not on sleight of hand, but rather on a theorem 
in abstract algebra. Anyone can do it, but to understand it you need 
some group theory. And before beginning the course, you might first try 
your skills on the famous (some would say infamous) tile puzzle. In this 
puzzle, a frame has 12 spaces, the first 11 with numbered tiles and the 
last vacant. The last two tiles are out of order. Is it possible to slide the 
tiles around to get them all in order, and end again with the last space 
vacant? After giving up on this, you can study permutation groups and 
learn the answer! 



Chapter 1 

Background and Fundamentals of 
Mathematics 



This chapter is fundamental, not just for algebra, but for all fields related to mathe- 
matics. The basic concepts are products of sets, partial orderings, equivalence rela- 
tions, functions, and the integers. An equivalence relation on a set A is shown to be 
simply a partition of A into disjoint subsets. There is an emphasis on the concept 
of function, and the properties of surjective, injective, and bijective. The notion of a 
solution of an equation is central in mathematics, and most properties of functions 
can be stated in terms of solutions of equations. In elementary courses the section 
on the Hausdorff Maximality Principle should be ignored. The final section gives a 
proof of the unique factorization theorem for the integers. 

Notation Mathematics has its own universally accepted shorthand. The symbol 
3 means "there exists" and 3! means "there exists a unique". The symbol V means 
"for each" and =>■ means "implies" . Some sets (or collections) are so basic they have 
their own proprietary symbols. Five of these are listed below. 

N = Z + = the set of positive integers = {1, 2, 3, ...} 

Z = the ring of integers = {..., —2, — 1, 0, 1, 2, ...} 

Q = the field of rational numbers = {a/b : o, b G Z, b ^ 0} 

R = the field of real numbers 

C = the field of complex numbers = {a + bi : a,b G R} (i 2 = —1) 

Sets Suppose A,B,C,... are sets. We use the standard notation for intersection 
and union. 

An B = {x : x G A and x G B} = the set of all x which are elements 
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of A and B. 

A U B = {x : x G A or x G B} = the set of all x which are elements of 
A or B. 

Any set called an index set is assumed to be non-void. Suppose T is an index set and 
for each t G T , A t is a set. 



|J A t = {x : 3 t G T with a; G A t } 

f) A t = {x : if t E T,x E A t } = {x :Vt e T,x £ A t } 
teT 

Let be the null set. If A n 5 = 0, then A and U are said to be disjoint. 

Definition Suppose each of A and B is a set. The statement that A is a subset 
of B (Ac -B) means that if a is an element of A, then a is an element of B. That 
is, oei^aGB. If A C .B we may say A is contained in B, or i3 contains A. 

Exercise Suppose each of A and B is a set. The statement that A is not a subset 
of .B means 

Theorem (De Morgan's laws) Suppose 5* is a set. If C C S (i.e., if C is a subset 
of S), let C", the complement of C in 5*, be defined by C = S — C = {x E S : x $. C}. 
Then for any A, B C S, 

(A n B)' = A' U B' and 
(A U B)' = A'nB' 



Cartesian Products If X and Y are sets, X xY = {(x,y) : x E X and y G Y}. 
In other words, the Cartesian product of X and Y is defined to be the set of all 
ordered pairs whose first term is in X and whose second term is in Y . 

Example R x R = R 2 = the plane. 
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Definition If each of Xi, ...,X n is a set, Xi x • • • x X n = {(xi, ...,x n ) : x, G X, 

for 1 < i < n} = the set of all ordered n-tuples whose i-th term is in Xi. 

Example R x • • • x R = R n = real n-space. 

Question Is (R x R 2 ) = (R 2 x R) = R 3 ? 

Relations 



If A is a non-void set, a non-void subset R C A x A is called a relation on A. If 
(a, b) G -R we say that a is related to b, and we write this fact by the expression a ~ b. 
Here are several properties which a relation may possess. 

1) If a G A, then a ~ a. (reflexive) 

2) If a ~ 6, then 6 ~ a. (symmetric) 

2') If a ~ 6 and 6 ~ a, then a = b. (anti-symmetric) 

3) If a ~ 6 and 6 ~ c, then o ~ c. (transitive) 

Definition A relation which satisfies 1), 2'), and 3) is called a partial ordering. 
In this case we write a ~ b as a < 6. Then 

1) If a G A, then a < a. 

2') li a < b and b < a, then a = b. 

3) If o < 6 and b < c, then a < c. 

Definition A linear ordering is a partial ordering with the additional property 
that, if a, b G A, then a < b or b < a. 

Example ^4 = R with the ordinary ordering, is a linear ordering. 

Example A = all subsets of R 2 , with a < b defined by o C b, is a partial ordering. 



Hausdorff Maximality Principle (HMP) Suppose S is a non-void subset of A 
and ~ is a relation on A. This defines a relation on S. If the relation satisfies any 
of the properties 1), 2), 2'), or 3) on A, the relation also satisfies these properties 
when restricted to S. In particular, a partial ordering on A defines a partial ordering 
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on S. However the ordering may be linear on S but not linear on A. The HMP is 
that any linearly ordered subset of a partially ordered set is contained in a maximal 
linearly ordered subset. 

Exercise Define a relation on A = R 2 by (o, b) ~ (c, d) provided a < c and 
b < d. Show this is a partial ordering which is linear on S = {(a, a) : a < 0}. Find 
at least two maximal linearly ordered subsets of R 2 which contain S. 

One of the most useful applications of the HMP is to obtain maximal monotonic 
collections of subsets. 

Definition A collection of sets is said to be monotonic if, given any two sets of 
the collection, one is contained in the other. 

Corollary to HMP Suppose X is a non-void set and A is some non-void 
collection of subsets of X , and S is a subcollection of A which is monotonic. Then 3 
a maximal monotonic subcollection of A which contains S. 

Proof Define a partial ordering on A by V < W iff V C W, and apply HMP. 

The HMP is used twice in this book. First, to show that infinitely generated 
vector spaces have free bases, and second, in the Appendix, to show that rings have 
maximal ideals (see pages 87 and 109). In each of these applications, the maximal 
monotonic subcollection will have a maximal element. In elementary courses, these 
results may be assumed, and thus the HMP may be ignored. 



Equivalence Relations A relation satisfying properties 1), 2), and 3) is called 
an equivalence relation. 

Exercise Define a relation on A = Z by n ~ m iff n — m is a multiple of 3. 
Show this is an equivalence relation. 

Definition If ~ is an equivalence relation on A and a 6 A, we define the equiva- 
lence class containing a by cl(a) = {x e A : a ~ x}. 
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Theorem 

1) If b G cl(a) then cl(b) = cl(a). Thus we may speak of a subset of A 
being an equivalence class with no mention of any element contained 
in it. 

2) If each of U, V C A is an equivalence class and U fl V ^ 0, then 
f7 = V. 

3) Each element of A is an element of one and only one equivalence class. 

Definition A partition of A is a collection of disjoint non-void subsets whose union 
is A. In other words, a collection of non-void subsets of A is a partition of A provided 
any a G A is an element of one and only one subset of the collection. Note that if A 
has an equivalence relation, the equivalence classes form a partition of A. 

Theorem Suppose A is a non-void set with a partition. Define a relation on A by 
a ~ b iff a and b belong to the same subset of the partition. Then ~ is an equivalence 
relation, and the equivalence classes are just the subsets of the partition. 

Summary There are two ways of viewing an equivalence relation - - one is as a 
relation on A satisfying 1), 2), and 3), and the other is as a partition of A into 
disjoint subsets. 

Exercise Define an equivalence relation on Z by n ~ m iff n — m is a multiple 
of 3. What are the equivalence classes? 

Exercise Is there a relation on R satisfying 1), 2), 2') and 3) ? That is, is there 
an equivalence relation on R which is also a partial ordering? 

Exercise Let H C R 2 be the line H = {(a, 2a) : a G R}. Consider the collection 
of all translates of H, i.e., all lines in the plane with slope 2. Find the equivalence 
relation on R 2 defined by this partition of R 2 . 

Functions 



Just as there are two ways of viewing an equivalence relation, there are two ways 
of defining a function. One is the "intuitive" definition, and the other is the "graph" 
or "ordered pairs" definition. In either case, domain and range are inherent parts of 
the definition. We use the "intuitive" definition because everyone thinks that way. 
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Definition If X and Y are (non-void) sets, a function or mapping or map with 
domain X and range Y, is an ordered triple (X, Y, f) where / assigns to each x G X 
a well defined element /(#) G V. The statement that (X, Y, f) is a function is written 

as / : X -»■ Y or X -£ Y. 

Definition The graph of a function (X, Y, f) is the subset r C X x Y defined 
by T = {(x,f(x)) : x G X}. The connection between the "intuitive" and "graph" 
viewpoints is given in the next theorem. 

Theorem If / : X — ► Y, then the graph r C X x Y has the property that each 
x G X is the first term of one and only one ordered pair in I\ Conversely, if T is a 
subset of X x Y with the property that each xGXis the first term of one and only 
ordered pair in T, then 3! / : X — ► Y whose graph is I\ The function is defined by 
"f(x) is the second term of the ordered pair in F whose first term is x." 



Example Identity functions Here X = Y and / : X — > X is defined by 
f(x) = x for all x G X. The identity on X is denoted by Ix or just / : X — >• X. 

Example Constant functions Suppose y £ Y. Define / : X — ► Y by /(x) = 
y for all x G X. 

Restriction Given / : X — ► Y and a non-void subset S of X, define / | S : 5 — ► Y" 
by (/ | 5)(s) = /(s) for all sG 5. 

Inclusion If 5 is a no n- void subset of X, define the inclusion i : 5 — ► X by 

i(s) = s for all s E S. Note that inclusion is a restriction of the identity. 

Composition Given W — ► X — ► Y" define g o f : W ^ Y by 

Theorem (The associative law of composition) If V —> W —> X —> Y , then 
ho (g o f) = (h o g) o f. This may be written as ho g o f. 
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Definitions Suppose / : X — ► F. 

1) If T c Y , the inverse image of T is & subset of X , f~ l (T) = {x E X : 
f(x) G T}. 

2) If 5 C X, the image of S is a subset of F, /(>S) = {/(s) : s G S} = 
{yeY:3seS with /(s) = y}. 

3) The image of f is the image of X , i.e., image (/) = f(X) = 
{/(x) : a; G X} = {u G F : 3x G X with /(x) = 2/}. 

4) / : X — ► Y is surjective or onto provided image (/) = F i.e., the image 
is the range, i.e., if y G Y, f~ l (y) is a non-void subset of X. 

5) / : X —y Y is infective or 1-1 provided (xi ^ X2) =>■ /(^i) 7^ f(x2), i.e., 
if X! and x 2 are distinct elements of X, then f(x\) and /(x 2 ) are 
distinct elements of Y. 

6) / : X — > F is bijective or is a 1-1 correspondence provided / is surjective 
and injective. In this case, there is function / _1 : F — ► X with / _1 o / = 
I X :X^X and / o f~ l = I Y : F ->■ F. Note that J" 1 : F ->■ X is 
also bijective and (/ _1 ) _1 = /. 

Examples 

1) / : R —y R defined by f(x) = sin(x) is neither surjective nor injective. 

2) / : R —y [—1, 1] defined by f(x) = sin(x) is surjective but not injective. 

3) / : [0,7r/2] —y R defined by f(x) = sin(x) is injective but not surjective. 

4) / : [0,7r/2] —y [0, 1] defined by /(x) = sin(x) is bijective. (f~ 1 (x) is 
written as arcsin(x) or sin _1 (x).) 

5) / : R — ► (0, 00) defined by /(x) = e x is bijective. (/ _1 (x) is written as 
ln(x).) 

Note There is no such thing as "the function sin(x)." A function is not defined 
unless the domain and range are specified. 
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Exercise Show there are natural bijections from (R x R 2 ) to (R 2 x R) and 
from (R 2 x R) to R x R x R. These three sets are disjoint, but the bijections 
between them are so natural that we sometimes identify them. 

Exercise Suppose X is a set with 6 elements and Y is a finite set with n elements. 

1) There exists an injective / : X — ► Y iff n 

2) There exists a surjective / : X — > Y iff n 

3) There exists a bijective / : X — > Y iff n 



Pigeonhole Principle Suppose X is a finite set with m elements, Y is a finite 
set with n elements, and / : X — ► Y is a function. 

1) If m = n, then / is injective iff / is surjective iff / is bijective. 

2) If m > n, then / is not injective. 

3) If m < n, then / is not surjective. 

If you are placing 6 pigeons in 6 holes, and you run out of pigeons before you fill 
the holes, then you have placed 2 pigeons in one hole. In other words, in part 1) for 
m = n = 6, if / is not surjective then / is not injective. Of course, the pigeonhole 
principle does not hold for infinite sets, as can be seen by the following exercise. 

Exercise Show there is a function / : Z + — ► Z + which is injective but not 
surjective. Also show there is one which is surjective but not injective. 

Exercise Suppose / : [-2,2] -> R is defined by f(x) = x 2 . Find / _1 (/([1, 2])). 
Also find fif-'dS, 5])). 

Exercise Suppose / : X — ► Y is a function, S C X and T C Y. Find the 
relationship between S and /~ 1 (/(S')). Show that if / is injective, S = /~ 1 (/(5')). 
Also find the relationship between T and /(/ _1 (T)). Show that if / is surjective, 
T = /(/- 1 (T)). 



Strips We now define the vertical and horizontal strips of X x Y. 

If x G X, {{xo,y) : y G Y} = (x x Y) is called a vertical strip. 
If yo G Y, {(x,yo) : x G X} = (X x y ) is called a horizontal strip. 
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Theorem Suppose S C X x Y . The subset S is the graph of a function with 
domain X and range Y iff each vertical strip intersects S in exactly one point. 

This is just a restatement of the property of a graph of a function. The purpose 
of the next theorem is to restate properties of functions in terms of horizontal strips. 



Theorem Suppose / : X — ► Y has graph I\ Then 

1) Each horizontal strip intersects T in at least one point iff / is . 

2) Each horizontal strip intersects T in at most one point iff / is 

3) Each horizontal strip intersects T in exactly one point iff / is 



Solutions of Equations Now we restate these properties in terms of solutions of 
equations. Suppose / : X — > Y and yo G Y. Consider the equation f{x) = yo- Here 
yo is given and x is considered to be a "variable" . A solution to this equation is any 
xq E X with f(xo) = yo. Note that the set of all solutions to f(x) = yo is f~ 1 (yo)- 
Also f(x) = yo has a solution iff y G image(f) iff f~ 1 (yo) is non-void. 

Theorem Suppose / : X — ► Y. 

1) The equation /(#) = yo has at least one solution for each yo G Y iff 

/ is 

2) The equation f(x) = yo has at most one solution for each y G Y iff 

/ is 

3) The equation f(x) = y Q has a unique solution for each y Q G Y iff 

/ is 



Right and Left Inverses One way to understand functions is to study right and 
left inverses, which are defined after the next theorem. 

Theorem Suppose X — > Y — ► IT are functions. 
1) If g o / is injective, then / is injective. 
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2) If g o f is surjective, then g is surjective. 

3) If g o f is bijective, then / is injective and g is surjective. 

Example X = W = {p}, Y = {p, q}, f(p) = p, and g(p) = g(q) = p. Here 
g o f is the identity, but / is not surjective and g is not injective. 

Definition Suppose / : X — ► Y is a function. A left inverse of / is a function 
g : Y — > X such that gof = I x :X^X. A right inverse of / is a function 
/i : F ^ X such that foh = I Y :Y^Y. 

Theorem Suppose / : X — > F is a function. 

1) / has a right inverse iff / is surjective. Any such right inverse must be 
injective. 

2) / has a left inverse iff / is injective. Any such left inverse must be 
surjective. 

Corollary Suppose each of X and Y is a non-void set. Then 3 an injective 
/ : X —y Y iff 3 a surjective g : Y — ► A. Also a function from A to Y is bijective 
iff it has a left inverse and a right inverse iff it has a left and right inverse. 

Note The Axiom of Choice is not discussed in this book. However, if you worked 
1) of the theorem above, you unknowingly used one version of it. For completeness, 
we state this part of 1) again. 

The Axiom of Choice If / : X — ► Y is surjective, then / has a right inverse 
h. That is, for each y £ Y, it is possible to choose an x G f~ l (y) and thus to define 
h(y) =x. 

Note It is a classical theorem in set theory that the Axiom of Choice and the 
Hausdorff Maximality Principle are equivalent. However in this text we do not go 
that deeply into set theory. For our purposes it is assumed that the Axiom of Choice 
and the HMP are true. 

Exercise Suppose / : X — ► Y is a function. Define a relation on X by a ~ b if 
f(a) = f(b). Show this is an equivalence relation. If y belongs to the image of /, 
then f~ x {y) is an equivalence class and every equivalence class is of this form. In the 
next chapter where / is a group homomorphism, these equivalence classes will be 
called cosets. 



Chapter 1 Background 



11 



Projections 

7Ti : Xi x X 2 - 



If Xi and X2 are non-void sets, we define the projection maps 
X\ and 7T2 : X\ x X2 — ► X2 by 7Tj(xi,X2) = £*■ 



Theorem If y, Xi, and X2 are no n- void sets, there is a 1-1 correspondence 
between {functions /: Y —>■ X 1 x X 2 } and {ordered pairs of functions (/1, / 2 ) where 
/1: y - X! and f 2 : y - X 2 }. 

Proof Given /, define fi = n 1 o f and f 2 = n 2 o /. Given f 1 and / 2 define 
/ : y -»■ X 1 x X 2 by /(y) = (fi(y),f 2 (y))- Thus a function from y to X x x X 2 is 
merely a pair of functions from Y to X\ and y to X2. This concept is displayed in 
the diagram below. It is summarized by the equation / = (/ 1; f 2 ). 



Y 





X ± ■ 7Fl X 1 x X 2 n2 • X 2 



One nice thing about this concept is that it works fine for infinite Cartesian 
products. 

Definition Suppose T is an index set and for each t G T, X t is a non-void set. 

Then the product J\X t = Y\X t is the collection of all sequences {x t }t^T = {xt} 

teT 
where x t G X t . Formally these sequences are functions a from T to \JX t with each 

a(t) in X t and written as a(t) = x t - If T = {1,2, . . . ,n} then {x t } is the ordered 

n-tuple (xi,x 2 ,. . . ,x n ). If T = Z + then {x t } is the sequence (xi,x 2 ,- ■ •)• For any T 

and any s in T, the projection map 7r s : Y\X t ^ X s is defined by ir s ({x t }) = x s . 



Theorem If Y is any no n- void set, there is a 1-1 correspondence between 

{functions / : Y — > El^t} an d {sequences of functions {/t}t e T where f t :Y—y X t }. 
Given /, the sequence {f t } is defined by ft = ir t o f. Given {ft}, f is defined by 
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A Calculus Exercise Let A be the collection of all functions / : [0, 1] — ► R 
which have an infinite number of derivatives. Let Aq C A be the subcollection of 
those functions / with /(0) = 0. Define D : Aq — ► ^4 by D(/) = df/dx. Use the mean 
value theorem to show that D is injective. Use the fundamental theorem of calculus 
to show that D is surjective. 

Exercise This exercise is not used elsewhere in this text and may be omitted. It 
is included here for students who wish to do a little more set theory. Suppose T is a 
non-void set. 

1) If Y is a non-void set, define Y T to be the collection of all functions with domain 
T and range Y. Show that if T and Y are finite sets with m and n elements, then 
Y T has n m elements. In particular, when T = {1,2,3}, Y T = Y x Y x Y has 
n 3 elements. Show that if n > 3, the subset of yl 1 ' 2 ' 3 } of all injective functions has 
n(n — l)(n — 2) elements. These injective functions are called permutations on Y 
taken 3 at a time. If T = N, then Y T is the infinite product Y x Y x • • • . That is, 
Y N is the set of all infinite sequences (2/1,2/2, ■ ■ •) where each yi G Y. For any Y and 
T, let Y t be a copy of Y for each t G T. Then Y T = JJ Y t . 

teT 

2) Suppose each of Y\ and Y2 is a non-void set. Show there is a natural bijection 

from (Yi x Y2) to Kf x Y 2 T . (This is the fundamental property of Cartesian products 
presented in the two previous theorems.) 

3) Define V(T), the power set of T, to be the collection of all subsets of T (including 
the null set). Show that if T is a finite set with m elements, V(T) has 2 m elements. 

4) If S is any subset of T, define its characteristic function x s '■ T — ► {0,1} by 
letting % s (t) be 1 when t G S, and be when t £ 5. Define a : P(T) — > {0, 1} T by 
a(5) = x s - Define /3 : {0, 1} T -> P(T) by (3(f) = /- x (l). Show that if S C T then 
/3 o a(5) = 5, and if / : T — ► {0, 1} then a o /?(/) = /. Thus a is a bijection and 
/5 = a" 1 . 

V(T)^{0,1} T 

5) Suppose 7 : T — ► {0, 1} T is a function and show that it cannot be surjective. If 
t G T, denote 7 (t) by 7 (t) = / t : T -> {0, 1}. Define / : T ->■ {0, 1} by /(t) = if 
/i(t) = 1, and f(t) = 1 if f t (t) = 0. Show that / is not in the image of 7 and thus 
7 cannot be surjective. This shows that if T is an infinite set, then the set {0, 1} T 
represents a "higher order of infinity than T" . 

6) An infinite set Y is said to be countable if there is a bijection from the positive 
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integers N to Y. Show Q is countable but the following three collections are not. 

i) V(N), the collection of all subsets of N. 

ii) {0, 1} N , the collection of all functions / : N -► {0, 1}. 

hi) The collection of all sequences (yi,V2, ■ ■ •) where each y^ is or 1. 

We know that ii) and hi) are equal and there is a natural bijection between i) 
and ii). We also know there is no surjective map from N to {0, 1} N , i.e., {0, 1} N is 
uncountable. Finally, show there is a bijection from {0, 1} N to the real numbers R. 
(This is not so easy. To start with, you have to decide what the real numbers are.) 

Notation for the Logic of Mathematics 



Each of the words "Lemma" , "Theorem" , and "Corollary" means "true state- 
ment" . Suppose A and B are statements. A theorem may be stated in any of the 
following ways: 

Theorem Hypothesis Statement A. 
Conclusion Statement B. 

Theorem Suppose A is true. Then B is true. 
Theorem If A is true, then B is true. 
Theorem A ^> B {A implies B ). 

There are two ways to prove the theorem — to suppose A is true and show B is 
true, or to suppose B is false and show A is false. The expressions U A <£4> B" , U A is 
equivalent to B" , and "A is true iff B is true " have the same meaning (namely, that 
A => B and B =» A). 

The important thing to remember is that thoughts and expressions flow through 
the language. Mathematical symbols are shorthand for phrases and sentences in the 
English language. For example, "x G B " means "x is an element of the set B." If A 
is the statement "x G Z + " and B is the statement u x 2 G Z + ", then "A =4> B" means 
"If x is a positive integer, then x 2 is a positive integer" . 



Mathematical Induction is based upon the fact that if S C Z + is a non-void 
subset, then S contains a smallest element. 
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Theorem Suppose P(n) is a statement for each n = 1,2,... . Suppose P(l) is 
true and for each n > 1, P(n) =r- P(n + 1). Then for each n > 1, P(n) is true. 

Proof If the theorem is false, then 3 a smallest positive integer m such that 

P(m) is false. Since P(m — 1) is true, this is impossible. 

Exercise Use induction to show that, for each n > 1, 1 + 2 H \-n = n(n+l)/2. 

The Integers 



In this section, lower case letters a,b, c, ... will represent integers, i.e., elements 
of Z. Here we will establish the following three basic properties of the integers. 

1) If G is a subgroup of Z, then 3 n > such that G = riL. 

2) If a and b are integers, not both zero, and G is the collection of all linear 
combinations of a and b, then G is a subgroup of Z, and its 

positive generator is the greatest common divisor of a and b. 

3) If n > 2, then n factors uniquely as the product of primes. 

All of this will follow from long division, which we now state formally. 

Euclidean Algorithm Given a,b with b ^ 0, 3! m and r with < r <|6| and 
a = bra + r. In other words, b divides a "m times with a remainder of r" . For 
example, if a = —17 and 6 = 5, then m = —4 and r = 3, —17 = 5(— 4) + 3. 

Definition If r = 0, we say that b divides a or a is a multiple of b. This fact is 
written as b \ a. Note that b \ a 4=> the rational number a/b is an integer 44> 3! m 
such that a = bm O- a G bZ. 



Note Anything (except 0) divides 0. does not divide anything. 

± 1 divides anything . If n ^ 0, the set of integers which n divides 
is vJL = {nm : m G Z} = {..., — 2n, —n, 0, n, 2n, ...}. Also n divides 
a and b with the same remainder iff n divides (a — b). 



Definition A non-void subset G C Z is a subgroup provided (g G G =>■ — g G G) 
and (gi,g2 E G => (g± +52) G G). We say that G is closed under negation and closed 
under addition. 
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Theorem If n G Z then riL is a subgroup. Thus if n ^ 0, the set of integers 
which n divides is a subgroup of Z. 

The next theorem states that every subgroup of Z is of this form. 

Theorem Suppose G C Z is a subgroup. Then 

1) OgG. 

2) If (/! and #2 £ G, then (migi + m 2 g2) & G for all integers va^va-i- 

3) 3! non-negative integer n such that G = nZ. In fact, if G^{0} 
and n is the smallest positive integer in G, then G = riL. 

Proof Since G is non-void, 3 g G G. Now (— g) G G and thus = g + (— #) 
belongs to G, and so 1) is true. Part 2) is straightforward, so consider 3). If G ^ 0, 
it must contain a positive element. Let n be the smallest positive integer in G. If 
g G G, g = nm + r where < r < n. Since r G G, it must be 0, and g G nZ. 



Now suppose a, 6 G Z and at least one of a and 6 is non-zero. 

Theorem Let G be the set of all linear combinations of a and b, i.e., G = 
{ma + nb : m, n G Z}. Then 

1) G contains o and 6. 

2) G is a subgroup. In fact, it is the smallest subgroup containing a and 6. 
It is called the subgroup generated by a and 6. 

3) Denote by (a, 6) the smallest positive integer in G. By the previous 
theorem, G = (a,6)Z, and thus (o, 6) | a and (a,b) | 6. Also note that 
3 m, n such that ma + nb = (a, 6). The integer (a, 6) is called 

the greatest common divisor of a and 6. 

4) If n is an integer which divides a and 6, then n also divides (a, 6). 

Proof of 4) Suppose n \ a and n \ b i.e., suppose a, o G nZ. Since G is the 
smallest subgroup containing a and 6, nZ Z> (o, 6)Z, and thus n \ (a, 6). 

Corollary The following are equivalent. 

1) a and b have no common divisors, i.e., (n | a and n | 6) =^ n = ±1. 
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2) (a, b) = 1, i.e., the subgroup generated by a and b is all of Z. 

3) 3 772, n eZ with ma + nb = 1. 

Definition If any one of these three conditions is satisfied, we say that a and b 
are relatively prime. 



This next theorem is the basis for unique factorization. 

Theorem If a and b are relatively prime with a not zero, then a\bc =>- a\c. 

Proof Suppose o and b are relatively prime, c G Z and o | be. Then there exist 
m,n with ma + nb = 1, and thus mac + nac = c. Now a | mac and a \ nbc. Thus 
a | (mac + nfrc) and so a \ c. 

Definition A prime is an integer p > 1 which does not factor, i.e., if p = ab then 
a = ±1 or a = ±p. The first few primes are 2, 3, 5, 7, 11, 13, 17,... . 

Theorem Suppose p is a prime. 

1) If a is an integer which is not a multiple of p, then (p, a) = 1. In other 
words, if a is any integer, (p, a) = p or (p, a) = 1. 

2) If p | a& then p \ a or p | 6. 

3) If p | aia2 • • • a n then p divides some a^. Thus if each a^ is a prime, 
then p is equal to some Oj. 

Proof Part 1) follows immediately from the definition of prime. Now suppose 
p | ab. If p does not divide a, then by 1), (p, a) = 1 and by the previous theorem, p 
must divide 6. Thus 2) is true. Part 3) follows from 2) and induction on n. 



The Unique Factorization Theorem Suppose a is an integer which is not 0,1, 
or -1. Then a may be factored into the product of primes and, except for order, this 
factorization is unique. That is, 3 a unique collection of distinct primes Pi,P2, ■■■,Pk 
and positive integers s±, S2, ■■■, s^ such that a = zizp^p^ 2 ■ ■ ■ p s k k ■ 

Proof Factorization into primes is obvious, and uniqueness follows from 3) in the 
theorem above. The power of this theorem is uniqueness, not existence. 
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Now that we have unique factorization and part 3) above, the picture becomes 
transparent. Here are some of the basic properties of the integers in this light. 

Theorem (Summary) 

1) Suppose |o|> 1 has prime factorization a = ibpi 1 • • • p s k . Then the only 
divisors of a are of the form zbp^ 1 • • • p k where < ti < Si for i = 1, ..., k. 

2) If | a |> 1 and | b |> 1, then (a, b) = 1 iff there is no common prime in 
their factorizations. Thus if there is no common prime in their 
factorizations, 3 m, n with ma + nb = 1, and also (a 2 , b 2 ) = 1. 

3) Suppose | a\> 1 and |fe|> 1. Let {pi, ... ,p k } be the union of the distinct 
primes of their factorizations. Thus o = ipi 1 • • • p s k k where < Sj and 

b = ztp'i ■ ■ ■ p k k where < tj. Let -U; be the minimum of Si and £«. Then 
(o, b) = pT ■ ■ ■ p\ k . For example (2 3 • 5 • 11, 2 2 • 5 4 • 7) = 2 2 • 5. 

3') Let Vi be the maximum of Si and tj. Then c = p"i ■ ■ ■ p v k k is the least 
(positive) common multiple of a and b. Note that c is a multiple of 
a and 6, and if n is a multiple of a and b, then n is a multiple of c. 
Finally, if a and b are positive, their least common multiple is 
c = ab/(a, b), and if in addition a and b are relatively prime, 
then their least common multiple is just their product. 

4) There is an infinite number of primes. (Proof: Suppose there were only 
a finite number of primes Pi,P2, ■•-,Pk- Then no prime would divide 

(PlP2- ■ -Pfc + 1)-) 

5) Suppose c is an integer greater than 1. Then \fc is rational iff \fc is an 
integer. In particular, v2 and \/3 are irrational. (Proof: If \J~c~ is 
rational, 3 positive integers a and b with \fc = a/b and (a, b) = 1. 
lib > 1, then it is divisible by some prime, and since cb 2 = a 2 , this 
prime will also appear in the prime factorization of a. This is a 
contradiction and thus b = 1 and yfc is an integer.) (See the fifth 
exercise below.) 

Exercise Find (180,28), i.e., find the greatest common divisor of 180 and 28, 
i.e., find the positive generator of the subgroup generated by {180,28}. Find integers 
m and n such that 180m + 28n = (180, 28). Find the least common multiple of 180 
and 28, and show that it is equal to (180 • 28)/(180, 28). 
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Exercise We have defined the greatest common divisor (gcd) and the least com- 
mon multiple (1cm) of a pair of integers. Now suppose n > 2 and S = {oi, a 2 , ■-, a n } 
is a finite collection of integers with |oj| > 1 for 1 < i < n. Define the gcd and the 
lcm of the elements of S and develop their properties. Express the gcd and the 1cm 
in terms of the prime factorizations of the Oj. When is the lcm of S equal to the 
product (11(12 ■ ■ ■ a n ? Show that the set of all linear combinations of the elements of 
S is a subgroup of Z, and its positive generator is the gcd of the elements of S. 

Exercise Show that the gcd of S = {90,70,42} is 2, and find integers n l7 n 2 ,n3 
such that 90ni + 70n 2 + 42n 3 = 2. Also find the lcm of the elements of S. 

Exercise Show that if each of Gi, G 2 , ■■■, G m is a subgroup of Z, then 

Gi n G 2 n • • • n G m is also a subgroup of Z. Now let G = (90Z) n (70Z) n (42Z) 
and find the positive integer n with G = nZ. 

Exercise Show that if the nth root of an integer is a rational number, then it 
itself is an integer. That is, suppose c and n are integers greater than 1. There is a 
unique positive real number x with x n = c. Show that if x is rational, then it is an 
integer. Thus if p is a prime, its nth root is an irrational number. 

Exercise Show that a positive integer is divisible by 3 iff the sum of its digits is 
divisible by 3. More generally, let a = a„a n _i . . . 00 = a n 10 n + a n _ilO n_1 + • • • + 00 
where < Oj < 9. Now let b = a n + a n _i + • • • + Oo, and show that 3 divides a and b 
with the same remainder. Although this is a straightforward exercise in long division, 
it will be more transparent later on. In the language of the next chapter, it says that 
[a] = [b] in Z 3 . 

Card Trick Ask friends to pick out seven cards from a deck and then to select one 
to look at without showing it to you. Take the six cards face down in your left hand 
and the selected card in your right hand, and announce you will place the selected 
card in with the other six, but they are not to know where. Put your hands behind 
your back and place the selected card on top, and bring the seven cards in front in 
your left hand. Ask your friends to give you a number between one and seven (not 
allowing one). Suppose they say three. You move the top card to the bottom, then 
the second card to the bottom, and then you turn over the third card, leaving it face 
up on top. Then repeat the process, moving the top two cards to the bottom and 
turning the third card face up on top. Continue until there is only one card face 
down, and this will be the selected card. Magic? Stay tuned for Chapter 2, where it 
is shown that any non-zero element of Z 7 has order 7. 



Chapter 2 

Groups 



Groups are the central objects of algebra. In later chapters we will define rings and 
modules and see that they are special cases of groups. Also ring homomorphisms and 
module homomorphisms are special cases of group homomorphisms. Even though 
the definition of group is simple, it leads to a rich and amazing theory. Everything 
presented here is standard, except that the product of groups is given in the additive 
notation. This is the notation used in later chapters for the products of rings and 
modules. This chapter and the next two chapters are restricted to the most basic 
topics. The approach is to do quickly the fundamentals of groups, rings, and matrices, 
and to push forward to the chapter on linear algebra. This chapter is, by far and 
above, the most difficult chapter in the book, because group operations may be written 
as addition or multiplication, and also the concept of coset is confusing at first. 

Definition Suppose G is a non-void set and (/) : G x G —> G is a. function, (f) is 
called a binary operation, and we will write <f>(a, b) = a-b or 0(a, b) = a + b. Consider 
the following properties. 

1) If a, b, c £ G then a ■ (b ■ c) = (a ■ b) ■ c. If a,b,c £ G then a + (b + c) = (a + b) + c. 

2) 3 e = ec £ G such that if a G G 3 0=0^ £ G such that if a G G 

e ■ a = a ■ e = a. 0+a = a+0= a. 

3) If a G G, 3b £ G with a-b = b- a = e If a £ G,3b e G with a + b = b + a = 

(b is written as b = a -1 ). (6 is written as b = —a). 

4) If a, b G G, then a ■ b = b ■ a. If a,b E G, then a + b = b + a. 

Definition If properties 1), 2), and 3) hold, (G,(f>) is said to be a group. If we 
write (f>(a, b) = a ■ b, we say it is a multiplicative group. If we write (f)(a, b) = a + b, 
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we say it is an additive group. If in addition, property 4) holds, we say the group is 
abelian or commutative. 

Theorem Let (G, 4>) be a multiplicative group. 

(i) Suppose a,c,c G G. Then a ■ c = a ■ c =>■ c = c. 

Also c • a = c • a =>■ c = c. 
In other words, if / : G — ► G is denned by /(c) = a • c, then / is injective. 
Also / is bijective with f _1 given by / _1 (c) = a -1 • c. 

(ii) e is unique, i.e., if e G G satisfies 2), then e = e. In fact, 

if a, 6 G G then (a • 6 = a) =>■ (6 = e) and (a • 6 = b) => (a = e). 
Recall that b is an identity in G provided it is a right and left 
identity for any o in G. However, group structure is so rigid that if 
3 a G G such that b is a right identity for a, then b = e. 
Of course, this is just a special case of the cancellation law in (i). 

(iii) Every right inverse is an inverse, i.e., if a ■ b = e then b = a -1 . 
Also if b ■ a = e then b = a~ l . Thus inverses are unique. 

(iv) If a G G, then (a -1 )" 1 = a. 

(v) The multiplication 01-02-03 = a±- (02 • 03) = (ai • 02) • 03 is well-defined. 

In general, a± ■ a 2 ■ ■ ■ a n is well defined. 

(vi) If o, b G G, (o • 6) _1 = 6 _1 • a -1 . Also (ai • a 2 • • • o„) _1 = 
a n ' a n-l ' ' ' a i ■ 

(vii) Suppose a E G. Let a = e and if n > 0, a n = a • • • a (n times) 
and a _n = a -1 • • • a -1 (n times). If ni, n 2 , ..., n t G Z then 

ni . a n 2 . . . a n t = a n 1+ -+n t _ AlsQ ( n)m = fl nm_ 

Finally, if G is abelian and a,b E G, then (a • 6) ra = a n ■ b n . 

Exercise. Write out the above theorem where G is an additive group. Note that 
part (vii) states that G has a scalar multiplication over Z. This means that if a is in 
G and n is an integer, there is defined an element an in G. This is so basic, that we 
state it explicitly. 

Theorem. Suppose G is an additive group. If a G G, let aO =0 and if n > 0, 

let an = (o + • • +a) where the sum is n times, and a(—n) = (—a) + (—a) ■ ■ + (—a), 
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which we write as (—a — a ■ ■ — a). Then the following properties hold in general, 
except the first requires that G be abelian. 



(a + b)n = 


an + bn 


a(n + m) = 


an + am 


a(nm) = 


(an)m 


al = 


a 



Note that the plus sign is used ambiguously - - sometimes for addition in G 
and sometimes for addition in Z. In the language used in Chapter 5, this theorem 
states that any additive abelian group is a Z-module. (See page 71.) 

Exercise Suppose G is a non-void set with a binary operation (p(a, b) = a-b which 
satisfies 1), 2) and [ 3') If a G G, 3b G G with a ■ b = e\. Show (G, (f>) is a group, 
i.e., show b ■ a = e. In other words, the group axioms are stronger than necessary. 
If every element has a right inverse, then every element has a two sided inverse. 

Exercise Suppose G is the set of all functions from Z to Z with multiplication 
defined by composition, i.e., / • g = f o g. Note that G satisfies 1) and 2) but not 3), 
and thus G is not a group. Show that / has a right inverse in G iff / is surjective, 
and / has a left inverse in G iff / is injective (see page 10). Also show that the set 
of all bijections from Z to Z is a group under composition. 

Examples G = R, G = Q, or G = Z with 0(o, b) = a + b is an additive 
abelian group. 

Examples G = R — or G = Q — with 0(a, b) = ab is a multiplicative 
abelian group. 

G = Z — with (f)(a, b) = ab is not a group. 
G = R + = {r G R : r > 0} with 0(o, b) = ab is a multiplicative 
abelian group. 



Subgroups 



Theorem 

satisfying 



Suppose G is a multiplicative group and H C G is a non-void subset 



and 



1) if a, b G H then a-b G H 

2) if a G H then a" 1 G H. 
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Then e £ H and H is a group under multiplication. H is called a subgroup of G. 

Proof Since iJ is non-void, 3a G iZ. By 2), a -1 G i? and so by 1), e E H . The 
associative law is immediate and so H is a group. 

Example G is a subgroup of G and e is a subgroup of G. These are called the 
improper subgroups of G. 

Example If G = Z under addition, and n G Z, then iJ = nZ is a subgroup of 
Z. By a theorem in the section on the integers in Chapter 1, every subgroup of Z 
is of this form (see page 15). This is a key property of the integers. 



Exercises Suppose G is a multiplicative group. 

1) Let H be the center of G, i.e., H = {h G G : g ■ h = h ■ g for all g G G}. Show 
H is a subgroup of G. 

2) Suppose Hi and i^2 are subgroups of G. Show i^ n H 2 is a subgroup of G. 

3) Suppose i/i and H 2 are subgroups of G, with neither Hi nor i^2 contained in 
the other. Show Hi U H 2 is not a subgroup of G. 

4) Suppose T is an index set and for each t G T, iJ t is a subgroup of G. 
Show P| H t is a subgroup of G. 

5) Furthermore, if {i^} is a monotonic collection, then \\H t is a subgroup of G. 

tgT 

6) Suppose G= {all functions / : [0, 1] — > R}. Define an addition on G by 

(/ + g)(t) = f(t) + g(i) for all t G [0, 1]. This makes G into an abelian group. 
Let K be the subset of G composed of all different iable functions. Let H 
be the subset of G composed of all continuous functions. What theorems 
in calculus show that H and K are subgroups of G? What theorem shows 
that K is a subset (and thus subgroup) of HI 



Order Suppose G is a multiplicative group. If G has an infinite number of 
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elements, we say that o(G), the order of G, is infinite. If G has n elements, then 
o(G) = n. Suppose a G G and H = {a 1 : i G Z}. i7 is an abelian subgroup of G 
called the subgroup generated by a. We define the order of the element a to be the 
order of H, i.e., the order of the subgroup generated by a. Let / : Z — ► H be the 
surjective function defined by f(m) = a m . Note that f(k + /) = /(&) • f(l) where 
the addition is in Z and the multiplication is in the group H. We come now to the 
first real theorem in group theory. It says that the element a has finite order iff / 
is not injective, and in this case, the order of a is the smallest positive integer n 
with a n = e. 

Theorem Suppose a is an element of a multiplicative group G, and 

H = {a 1 : i £ Z}. If 3 distinct integers i and j with a 1 = a- 7 , then a has some finite 
order n. In this case H has n distinct elements, H = {a , a 1 , . . . , a n_1 }, and a m = e 
iff n\m. In particular, the order of a is the smallest positive integer n with a n = e, 
and / _1 (e) = nZ. 

Proof Suppose j < i and a 1 = a? . Then a* - - 7 = e and thus 3 a smallest positive 
integer n with a n = e. This implies that the elements of {a , a 1 , ..., a n_1 } are distinct, 
and we must show they are all of H . If m G Z, the Euclidean algorithm states that 
3 integers g and r with < r < n and m = nq + r. Thus a m = a ng • a r = a r , and 
so H = {a , a 1 , ...,a n_1 }, and a m = e iff n|m. Later in this chapter we will see that 
/ is a homomorphism from an additive group to a multiplicative group and that, 
in additive notation, H is isomorphic to Z or Z n . 

Exercise Write out this theorem for G an additive group. To begin, suppose a is 
an element of an additive group G, and H = {ai : i G Z}. 

Exercise Show that if G is a finite group of even order, then G has an odd number 
of elements of order 2. Note that e is the only element of order 1. 

Definition A group G is cyclic if 3 an element of G which generates G. 

Theorem If G is cyclic and H is a subgroup of G, then H is cyclic. 

Proof Suppose G = {a 1 : i G Z} is a cyclic group and i/ is a subgroup 
of G. If H = e, then i/ is cyclic, so suppose H ^ e. Now there is a small- 
est positive integer m with a m EH. If £ is an integer with a* G H , then by 
the Euclidean algorithm, m divides t, and thus a m generates H. Note that in 
the case G has finite order n, i.e., G = {o°, a 1 , . . . , a n_1 }, then a n = e E H , 
and thus the positive integer m divides n. In either case, we have a clear picture 
of the subgroups of G. Also note that this theorem was proved on page 15 for the 
additive group Z. 
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Cosets Suppose H is a subgroup of a group G. It will be shown below that H 
partitions G into right cosets. It also partitions G into left cosets, and in general 
these partitions are distinct. 

Theorem If if is a subgroup of a multiplicative group G, then a ~ b defined by 
a ~ b iff a ■ fc _1 G H is an equivalence relation. If a G G, c/(a) = {6 G G : a ~ 6} = 
{h-a:heH} = Ha. Note that a ■ b' 1 e H iff 6 • a -1 G #. 

If iif is a subgroup of an additive group G, then a ~ 6 defined by a ~ 6 iff 
(a — b) £ H is an equivalence relation. If a G G, c/(o) = {6 G G : o ~ 6} = {h + o : 
h E H} = H + a. Note that (a - 6) G H iff (6 - a) G if. 

Definition These equivalence classes are called right cosets. If the relation is 
defined by a ~ b iff 6 _1 • a G H, then the equivalence classes are cl(a) = aH and 
they are called left cosets. if is a left and right coset. If G is abelian, there is no 
distinction between right and left cosets. Note that b~ l ■ a G H iff a -1 • b G H. 

In the theorem above, H is used to define an equivalence relation on G, and thus 
a partition of G. We now do the same thing a different way. We define the right 
cosets directly and show they form a partition of G. You might find this easier. 

Theorem Suppose H is a subgroup of a multiplicative group G. If a G G, define 
the right coset containing a to be Ha = {h ■ a : h E H}. Then the following hold. 

1) Ha = H iff ae H. 

2) If 6 G #a, then ifft = Fa, i.e., iihe H, then #(/i • a) = (Hh)a = Ha. 

3) If Hen Ha f 0, then He = Ha. 

4) The right cosets form a partition of G, i.e., each a in G belongs to one and 

only one right coset. 

5) Elements a and b belong to the same right coset iff a ■ b~ x EH iff b ■ a~ l G H . 

Proof There is no better way to develop facility with cosets than to prove this 
theorem. Also write this theorem for G an additive group. 



Theorem Suppose H is a subgroup of a multiplicative group G. 
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1) Any two right cosets have the same number of elements. That is, if a, b G G, 
f : Ha —y Hb denned by f(h ■ a) — h ■ b is a bijection. Also any two left cosets 
have the same number of elements. Since H is a right and left coset, any 
two cosets have the same number of elements. 

2) G has the same number of right cosets as left cosets. The function F defined 
by F(Ha) = a~ l H is a bijection from the collection of right cosets to the left 
cosets. The number of right (or left) cosets is called the index of H in G. 

3) If G is finite, o(H) (index of H) = o(G) and so o(H) | o(G). In other words, 
o(G)/o(H) = the number of right cosets = the number of left cosets. 

4) If G is finite, and a E G, then o(a) | o(G). (Proof: The order of a is the order 
of the subgroup generated by a, and by 3) this divides the order of G.) 

5) If G has prime order, then G is cyclic, and any element (except e) is a generator. 
(Proof: Suppose o(G) = p and a G G, a ^ e. Then o(a) | p and thus o(a) = p.) 

6) If o{G) = n and o G G, then a n = e. (Proof: a°^ = e and n = o{a) {o{G)/o{a)) .) 



Exercises 

i) Suppose G is a cyclic group of order 4, G = {e, o, a 2 , a 3 } with a 4 = e. Find the 
order of each element of G. Find all the subgroups of G. 

ii) Suppose G is the additive group Z and H = 3Z. Find the cosets of H . 

iii) Think of a circle as the interval [0, 1] with end points identified. Suppose G = R 
under addition and H = Z. Show that the collection of all the cosets of H 
can be thought of as a circle. 

iv) Let G = R 2 under addition, and H be the subgroup defined by 

H = {(a, 2a) : a G R}. Find the cosets of H. (See the last exercise on p 5.) 

Normal Subgroups 



We would like to make a group out of the collection of cosets of a subgroup H . In 
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general, there is no natural way to do that. However, it is easy to do in case H is a 
normal subgroup, which is described below. 

Theorem If H is a subgroup of a group G, then the following are equivalent. 

1) If a G G, then aiJo" 1 = H 

2) If a G G, then aHa- 1 C H 

3) If a G G, then ai/ = Ha 

4) Every right coset is a left coset, i.e., if o G G, 3 b G G with iJa = 6iJ. 

Proof 1) =>■ 2) is obvious. Suppose 2) is true and show 3). We have (aHa~ 1 )a C 

Ha so ai/ C i/a. Also a(a~ 1 Ha) C ai/ so Ha C aiJ. Thus ai/ = Ha. 

3) =>• 4) is obvious. Suppose 4) is true and show 3). Ha = bH contains a, so 

bH = aH because a coset is an equivalence class. Thus aH = Ha. 

Finally, suppose 3) is true and show 1). Multiply aH = Ha on the right by a -1 . 

Definition If H satisfies any of the four conditions above, then H is said to be a 
normal subgroup of G. (This concept goes back to Evariste Galois in 1831.) 

Note For any group G, G and e are normal subgroups. If G is an abelian group, 
then every subgroup of G is normal. 

Exercise Show that if H is a subgroup of G with index 2, then H is normal. 

Exercise Show the intersection of a collection of normal subgroups of G is a 
normal subgroup of G. Show the union of a monotonic collection of normal subgroups 
of G is a normal subgroup of G. 

Exercise Let A C R 2 be the square with vertices (—1,1), (1,1), (1,-1), and 
(—1,-1), and G be the collection of all "isometries" of A onto itself. These are 
bijections of A onto itself which preserve distance and angles, i.e., which preserve dot 
product. Show that with multiplication defined as composition, G is a multiplicative 
group. Show that G has four rotations, two reflections about the axes, and two 
reflections about the diagonals, for a total of eight elements. Show the collection of 
rotations is a cyclic subgroup of order four which is a normal subgroup of G. Show 
that the reflection about the rc-axis together with the identity form a cyclic subgroup 
of order two which is not a normal subgroup of G. Find the four right cosets of this 
subgroup. Finally, find the four left cosets of this subgroup. 
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Quotient Groups Suppose iV is a normal subgroup of G, and C and D are 

cosets. We wish to define a coset E which is the product of C and D. If c G C and 
d G D, define E to be the coset containing c • d, i.e., E = N(c ■ d). The coset E does 
not depend upon the choice of c and d. This is made precise in the next theorem, 
which is quite easy. 

Theorem Suppose G is a multiplicative group, N is a normal subgroup, and 
G/N is the collection of all cosets. Then (No) ■ (Nb) = N(a ■ b) is a well de- 
fined multiplication (binary operation) on G/N, and with this multiplication, G/N 
is a group. Its identity is iV and (Na)~ l = (Na~ l ). Furthermore, if G is finite, 
o(G/N) = o(G)/o(N). 

Proof Multiplication of elements in G/N is multiplication of subsets in G. 
(Na) ■ (Nb) = N(aN)b = N(Na)b = N(a ■ b). Once multiplication is well defined, 
the group axioms are immediate. 

Exercise Write out the above theorem for G an additive group. In the additive 
abelian group R/Z, determine those elements of finite order. 

Example Suppose G = Z under +, n > 1, and TV = riL. Z n , the group of 
integers mod n is defined by Z n = Z/nZ. If a is an integer, the coset a + riL is 
denoted by [a]. Note that [a] + [b] = [a + b], —[a] = [—a], and [a] = [a + nl] for any 
integer /. Any additive abelian group has a scalar multiplication over Z, and in this 
case it is just [a]m = [am]. Note that [a] = [r] where r is the remainder of a divided 
by n, and thus the distinct elements of Z n are [0], [1], ..., [n — 1]. Also Z n is cyclic 
because each of [1] and [—1] = [n — 1] is a generator. We already know that if p is a 
prime, any non-zero element of Z p is a generator, because Z p has p elements. 

Theorem If n > 1 and a is any integer, then [a] is a generator of Z n iff (a, n) = 1. 

Proof The element [a] is a generator iff the subgroup generated by [a] contains 
[1] iff 3 an integer k such that [a]k = [1] iff 3 integers k and I such that ak + nl = 1. 

Exercise Show that a positive integer is divisible by 3 iff the sum of its digits is 
divisible by 3. Note that [10] = [1] in Z 3 . (See the fifth exercise on page 18.) 

Homomorphisms 



Homomorphisms are functions between groups that commute with the group op- 
erations. It follows that they honor identities and inverses. In this section we list 
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the basic properties. Properties 11), 12), and 13) show the connections between coset 
groups and homomorphisms, and should be considered as the cornerstones of abstract 
algebra. As always, the student should rewrite the material in additive notation. 

Definition If G and G are multiplicative groups, a function / : G — > G is a 
homomorphism if, for all a,b G G, f(a ■ b) = /(a) • f(b). On the left side, the group 
operation is in G, while on the right side it is in G. The kernel of / is defined by 
ker(/) = f~ l (e) = {a G G : f(a) = e}. In other words, the kernel is the set of 
solutions to the equation f(x) = e. (If G is an additive group, ker(/) = / _1 (0).) 

Examples The constant map / : G — ► G defined by f(a) = e is a homomorphism. 
If H is a subgroup of G, the inclusion i : H — ► G is a homomorphism. The function 
/ : Z — ► Z defined by f(t) = 2t is a homomorphism of additive groups, while the 
function defined by f(t) = i + 2 is not a homomorphism. The function h : Z — ► R — 
defined by /i(t) = 2* is a homomorphism from an additive group to a multiplicative 
group. 



We now catalog the basic properties of homomorphisms. These will be helpful 
later on in the study of ring homomorphisms and module homomorphisms. 

Theorem Suppose G and G are groups and / : G — > G is a homomorphism. 

1) /(e) = e. 

2) /(a -1 ) = /(a) -1 . The first inverse is in G, and the second is in (5. 

3) / is injective <^> ker(/) = e. 

4) If if is a subgroup of G, f(H) is a subgroup of G. In particular, image(/) is 
a subgroup of (5. 

5) If H is a subgroup of G, f~ l (H) is a subgroup of G. Furthermore, if H is 
normal in (5, then / _1 (i/) is normal in G. 

6) The kernel of / is a normal subgroup of G. 

7) If g G G, / _1 (^) is void or is a coset of ker(/), i.e., if f(g) = g then 
f~ l (g) = Ng where iV= ker(/). In other words, if the equation f(x) = g has a 
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solution, then the set of all solutions is a coset of N= ker(/). This is a key fact 
which is used routinely in topics such as systems of equations and linear 
differential equations. 

8) The composition of homomorphisms is a homomorphism, i.e., if h : G — »G is 
a homomorphism, then h o f : G — >G is a homomorphism. 

9) If / : G — ► G is a bijection, then the function f~ l : G —> G is a homomorphism. 
In this case, / is called an isomorphism, and we write G ~ G. In the case 

G = G, f is also called an automorphism. 

10) Isomorphisms preserve all algebraic properties. For example, if / is an 
isomorphism and H C G is a subset, then iJ is a subgroup of G 

iff /(#) is a subgroup of (5, i7 is normal in G iff f(H) is normal in (5, G is 
cyclic iff (5 is cyclic, etc. Of course, this is somewhat of a cop-out, because an 
algebraic property is one that, by definition, is preserved under isomorphisms. 

11) Suppose H is a normal subgroup of G. Then -k : G — ► G/iJ defined by 
7r(o) = #a is a surjective homomorphism with kernel H. Furthermore, if 
/ : G — ► (5 is a surjective homomorphism with kernel iJ , then G/iJ pa (5 
(see below). 

12) Suppose H is a normal subgroup of G. If H C ker(/), then / : G/iJ — > G 
defined by f(Ha) = f(a) is a well-defined homomorphism making 

the following diagram commute. 

/ 

G -G 



TV 



f 



G/H 



Thus defining a homomorphism on a quotient group is the same as defining a 
homomorphism on the numerator which sends the denominator to e. The 
image of / is the image of / and the kernel of / is ker(f)/H. Thus if H = ker(/), 
/ is injective, and thus G/H fa image(/). 

13) Given any group homomorphism /, domain(/)/ker(/) sa image(/). This is 
the fundamental connection between quotient groups and homomorphisms. 
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14) Suppose K is a group. Then K is an infinite cycle group iff K is isomorphic to 
the integers under addition, i.e., f«Z. K is a cyclic group of order n iff 
K * Z n . 

Proof of 14) Suppose (5 = if is generated by some element a. Then / : Z — > if 
defined by /(m) = a m is a homomorphism from an additive group to a multiplicative 
group. If o(a) is infinite, / is an isomorphism. If o(a) = n, ker(/) = nZ and 
/ : Z n — ► if is an isomorphism. 

Exercise If a is an element of a group G, there is always a homomorphism from Z 
to G which sends 1 to o. When is there a homomorphism from Z n to G which sends [1] 
to a? What are the homomorphisms from Z 2 to Z 6 ? What are the homomorphisms 
from Z 4 to Z 8 ? 

Exercise Suppose G is a group and g is an element of G, g ^ e. 

1) Under what conditions on g is there a homomorphism / : Zj — ► G with 
/([I]) = 5? 

2) Under what conditions on g is there a homomorphism /: Z15 — ► G with 
/([I]) = 5? 

3) Under what conditions on G is there an injective homomorphism / : Z 15 — > G ? 

4) Under what conditions on G is there a surjective homomorphism / : Z 15 — ► G ? 



Exercise We know every finite group of prime order is cyclic and thus abelian. 
Show that every group of order four is abelian. 

Exercise Let G = {h : [0, 1] — ► R : h has an infinite number of derivatives}. 
Then G is a group under addition. Define / : G — ► G by f(h) = % = h! . Show / 
is a homomorphism and find its kernel and image. Let g : [0, 1] — ► R be defined by 
g(t) = i 3 — 3i + 4. Find f~ 1 (g) and show it is a coset of ker(/). 

Exercise Let G be as above and g G G. Define / : G — > G by /(/i) = /i" + 5/i' + 
6t 2 /z. Then / is a group homomorphism and the differential equation h" + 5h' + 6t 2 h = 
g has a solution iff g lies in the image of /. Now suppose this equation has a solution 
and S C G is the set of all solutions. For which subgroup H of G is 5 an ii-coset? 
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Exercise Suppose G is a multiplicative group and a G G. Define / : G — ► G to 
be conjugation by a, i.e., /(g) = a -1 • g ■ a. Show that / is a homomorphism. Also 
show / is an automorphism and find its inverse. 

Permutations 



Suppose A is a (non-void) set. A bijection / : A — > A is called a permutation 
on A, and the collection of all these permutations is denoted by S = S(X). In this 
setting, variables are written on the left, i.e., / = (#)/. Therefore the composition 
fog means "/ followed by g" . S(X) forms a multiplicative group under composition. 

Exercise Show that if there is a bijection between A and Y, there is an iso- 
morphism between 5(A) and S(Y). Thus if each of A and Y has n elements, 
S(X) ~ S(Y), and these groups are called the symmetric groups on n elements. 
They are all denoted by the one symbol S n . 

Exercise Show that o(S n ) = n\. Let A = {1,2, ...,n}, S n = 5(A), and H = 
{/ G S n : (n)f = n}. Show H is a subgroup of 5 n which is isomorphic to 5 n _i. Let 
g be any permutation on A with (n)g = 1. Find g~ 1 Hg. 

The next theorem shows that the symmetric groups are incredibly rich and com- 
plex. 

Theorem (Cayley's Theorem) Suppose G is a multiplicative group with n 
elements and S n is the group of all permutations on the set G. Then G is isomorphic 
to a subgroup of S n . 

Proof Let h : G — > S n be the function which sends a to the bijection h a : G — ► G 
defined by (g)/i a = g ■ a. The proof follows from the following observations. 

1) For each given a, /i a is a bijection from G to C 

2) h is a homomorphism, i.e., h a .b = h a o h^. 

3) h is injective and thus G is isomorphic to image(/i) C 5„. 



The Symmetric Groups Now let n > 2 and let 5„ be the group of all permu- 
tations on {1,2, ...,n}. The following definition shows that each element of S n may 
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be represented by a matrix. 

Definition Suppose 1 < k < n, {a±, a 2 , ..., a^} is a collection of distinct inte- 
gers with 1 < cii < n, and {b±, b 2l ..., bk} is the same collection in some different order. 

Then the matrix represents / G S n defined by (a^)/ = bi for 1 < i < k, 

y bi b 2 ... bk J 

and (a)f = a for all other a. The composition of two permutations is computed by 

applying the matrix on the left first and the matrix on the right second. 

There is a special type of permutation called a cycle. For these we have a special 
notation. 

Definition -••■ - ^ g ca ^ ec j a ^_ C y C l e and is denoted by (01, ao, .... a*.). 

v a 2 a 3 ...a k a x ) y y y k) 

A 2-cycle is called a transposition. The cycles (ai,...,Ofc) and (ci,...,q) are disjoint 

provided Oj 7^ c^ for all 1 < i < k and 1 < j < £. 

Listed here are eight basic properties of permutations. They are all easy except 
4), which takes a little work. Properties 9) and 10) are listed solely for reference. 

Theorem 

1) Disjoint cycles commute. (This is obvious.) 

2) Every nonidentity permutation can be written uniquely (except for order) as 
the product of disjoint cycles. (This is easy.) 

3) Every permutation can be written (non-uniquely) as the product of transposi- 
tions. (Proof: i" = (1,2)(1,2) and (ai, ..., a&) = (a±, 02)(ai, 03) • • • (01, a&). ) 

4) The parity of the number of these transpositions is unique. This means that if 
/ is the product of p transpositions and also of q transpositions, then p is 
even iff q is even. In this case, / is said to be an even permutation. In the other 
case, / is an odd permutation. 

5) A fc-cycle is even (odd) iff k is odd (even). For example (1,2,3) = (1,2)(1,3) is 
an even permutation. 

6) Suppose /, g 6 S n . If one of / and g is even and the other is odd, then g o / is 
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odd. If / and g are both even or both odd, then g o f is even. (Obvious.) 

7) The map h : S n — ► Z 2 defined by /i(even)= [0] and h(odd)= [1] is a 
homomorphism from a multiplicative group to an additive group. Its kernel (the 
subgroup of even permutations) is denoted by A n and is called the alternating 
group. Thus A n is a normal subgroup of index 2, and S n /A n pa Z 2 . 

8) If a, 6, c and d are distinct integers in {1,2, ... ,n}, then (a, 6) (6, c) = (a, c, 6) 
and (a, b)(c, d) = (a, c, d)(a, c, b). Since I = (1, 2, 3) 3 , it follows that for 

n > 3, every even permutation is the product of 3-cycles. 

The following parts are not included in this course. They are presented here merely 
for reference. 

9) For any n^4, A n is simple, i.e., has no proper normal subgroups. 

10) S n can be generated by two elements. In fact, {(1, 2), (1, 2, ..., n)} generates S n . 
(Of course there are subgroups of S n which cannot be generated by two 
elements). 

Proof of 4) It suffices to prove if the product of t transpositions is the identity I 
on {1,2,..., n}, then t is even. Suppose this is false and I is written as t transposi- 
tions, where t is the smallest odd integer this is possible. Since t is odd, it is at least 3. 
Suppose for convenience the first transposition is (a,n). We will rewrite I as a prod- 
uct of transpositions o\02 • • • cr t where (n)<7j = (n) for 1 < i < t and (n)at ^ n, which 
will be a contradiction. This can be done by inductively "pushing n to the right" 
using the equations below. If a,b, and c are distinct integers in {1,2, ... ,n — 1}, 
then (a,n)(a,n) = I, (a,n)(b,n) = (a,b)(a,n), (a,n)(a,c) = (a,c)(c,n), and 
(a,n)(b,c) = (b,c)(a,n). Note that (a,n)(a,n) cannot occur here because it would 
result in a shorter odd product. (Now you may solve the tile puzzle on page viii.) 

Exercise 

1) Write as the product of disjoint cycles. 

Write (1,5,6,7)(2,3,4)(3,7,1) as the product of disjoint cycles. 
Write (3,7,1)(1,5,6,7)(2,3,4) as the product of disjoint cycles. 
Which of these permutations are odd and which are even? 
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2) Suppose (oi, . . . , 0^) and (ci, . . . , q) are disjoint cycles. What is the order of 
their product? 

3) Suppose a G S n . Show that <t _1 (1, 2, 3)ct = ((l)c, (2)<r, (3)<j). This shows 
that conjugation by a is just a type of relabeling. Also let r = (4, 5, 6) and 
findr- 1 (l,2,3,4,5)r. 

4) Show that H = {a G Sq : (6)<j = 6} is a subgroup of Sq and find its right 
cosets and its left cosets. 

5) Let A C R 2 be the square with vertices (—1, 1), (1, 1), (1, —1), and (—1, —1), 
and G be the collection of all isometries of A onto itself. We know from a 
previous exercise that G is a group with eight elements. It follows from Cayley's 
theorem that G is isomorphic to a subgroup of S&. Show that G is isomorphic 
to a subgroup of S^. 

6) If G is a multiplicative group, define a new multiplication on the set G by 

o o b = b ■ a. In other words, the new multiplication is the old multiplication 
in the opposite order. This defines a new group denoted by G op , the opposite 
group. Show that it has the same identity and the same inverses as G, and 
that / : G — ► G op defined by f(a) = a~ l is a group isomorphism. Now consider 
the special case G = S n . The convention used in this section is that an element 
of S n is a permutation on {1, 2, . . . , n} with the variable written on the left. 
Show that an element of 5° p is a permutation on {1,2,..., n} with the variable 
written on the right. (Of course, either S n or S° p may be called the symmetric 
group, depending on personal preference or context.) 

Product of Groups 



The product of groups is usually presented for multiplicative groups. It is pre- 
sented here for additive groups because this is the form that occurs in later chapters. 
As an exercise, this section should be rewritten using multiplicative notation. The 
two theorems below are transparent and easy, but quite useful. For simplicity we 
first consider the product of two groups, although the case of infinite products is only 
slightly more difficult. For background, read first the two theorems on page 11. 

Theorem Suppose G\ and G 2 are additive groups. Define an addition on G± x G 2 
by (oi, a 2 ) + (&i, b 2 ) = (oi + 61,02 + b 2 ). This operation makes G\ x G 2 into a group. 
Its "zero" is (0 1 ,0 2 ) and —(01,02) = (— ai, — 02). The projections 7Ti : G\ x G 2 — ► G\ 
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and 7r 2 : G\ x G2 — ► G2 are group homomorphisms. Suppose G is an additive group. 
We know there is a bijection from {functions / : G — ► Gi x G2} to {ordered pairs of 
functions (/1, A) where /1 : G — ► Gi and f 2 '■ G —>■ G 2 }. Under this bijection, / is a 
group homomorphism iff each of f\ and /2 is a group homomorphism. 

Proof It is transparent that the product of groups is a group, so let's prove 
the last part. Suppose G,Gi, and G2 are groups and / = (fi,f 2 ) is a function 
from G to d x G 2 . Now /(a + b) = (/i(a + b),f 2 {a + 6)) and /(a) + /(&) = 
(/i(a), /2(a)) + {fi{b), /2(b)) = (A (a) + fi{b), / 2 (a) + / 2 (»)- An examination of these 
two equations shows that / is a group homomorphism iff each of fi and f 2 is a group 
homomorphism . 

Exercise Suppose G\ and G2 are groups. Show that G\ x G 2 and G2 x Gi are 
isomorphic. 

Exercise If o(ai) = m and 0(02) = n, find the order of (oi,a 2 ) in Gi x G2. 

Exercise Show that if G is any group of order 4, G is isomorphic to Z 4 or Z 2 xZ 2 . 
Show Z4 is not isomorphic to Z2 x Z2. Show Z12 is isomorphic to Z4 x Z3. Finally, 
show that Z mn is isomorphic to Z m x Z n iff (m,n) = 1. 

Exercise Suppose Gi and G2 are groups and i\ : Gi — ► G\ x G2 is defined by 
^i(5"i) = (#17 02)- Show ix is an injective group homomorphism and its image is a 
normal subgroup of G\ x G2. Usually G\ is identified with its image under i\, so G\ 
may be considered to be a normal subgroup of G\ x G2. Let -k 2 '■ G\ x G2 — *• G2 
be the projection map defined in the Background chapter. Show -k 2 is a surjective 
homomorphism with kernel G\. Therefore [G\ x G 2 )/Gi ~ G2 as you would expect. 



Exercise Let R be the reals under addition. Show that the addition in the 
product R x R is just the usual addition in analytic geometry. 

Exercise Suppose n > 2. Is S n isomorphic to A n x G where G is a multiplicative 
group of order 2 ? 

One nice thing about the product of groups is that it works fine for any finite 
number, or even any infinite number. The next theorem is stated in full generality. 
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Theorem Suppose T is an index set, and for any t G T, G t is an additive 

group. Define an addition on J^Gt = 11 Gt by {a t } + {b t } = {a t + b t }. This op- 

eration makes the product into a group. Its "zero" is {0 t } and — {a t } = {—a t }. 
Each projection ir s : n G t ~^ G s is a group homomorphism. Suppose G is an ad- 
ditive group. Under the natural bijection from {functions / : G — > Y\G t } to 
{sequences of functions {ft}t£T where f t : G — > Gt}, / is a group homomorphism 
iff each f t is a group homomorphism. Finally, the scalar multiplication on Yl Gt 
by integers is given coordinatewise, i.e., {a t }n = {atn}. 

Proof The addition on TJ G t is coordinatewise. 

Exercise Suppose s is an element of T and n s : TJ G t —>■ G s is the projection map 
defined in the Background chapter. Show ti s is a surjective homomorphism and find 
its kernel. 

Exercise Suppose s is an element of T and i s : G s — ► TJ Gt is defined by i s (a) = 
{at} where Ot = if £ 7^ s and a s = a. Show i s is an injective homomorphism 
and its image is a normal subgroup of Y\G t - Thus each G s may be considered to be 
a normal subgroup of Y\G t - 

Exercise Let / : Z — > Z30 x Z100 be the homomorphism defined by f(m) = 
([4m], [3m]). Find the kernel of /. Find the order of ([4], [3]) in Z 30 x Z 100 . 

Exercise Let / : Z — > Z90 x Z70 x Z42 be the group homomorphism defined by 
/(m) = ([m], [m], [m]). Find the kernel of / and show that / is not surjective. Let 
g : Z — ► Z45 x Z35 x Z21 be defined by ^(m) = ([m], [m], [m]). Find the kernel of 
g and determine if g is surjective. Note that the gcd of {45,35,21} is 1. Now let 
h : Z — > Z§ x Z9 x Z35 be defined by /i(m) = ([m], [m], [m]). Find the kernel of h 
and show that h is surjective. Finally suppose each of 6, c, and d is greater than 1 
and / : Z — ► Z;, x Z c x Z^ is defined by /(m) = ([m], [m], [m]). Find necessary and 
sufficient conditions for / to be surjective (see the first exercise on page 18). 

Exercise Suppose T is a non-void set, G is an additive group, and G T is the 
collection of all functions / : T — ► G with addition defined by (f + g)(i) = f(t) +g(t). 
Show G T is a group. For each t G T, let Gt = G. Note that G T is just another way 
of writing TT G t . Also note that if T = [0, 1] and G = R, the addition defined on 

teT 
G T is just the usual addition of functions used in calculus. (For the ring and module 
versions, see exercises on pages 44 and 69.) 



Chapter 3 

Rings 



Rings are additive abelian groups with a second operation called multiplication. The 
connection between the two operations is provided by the distributive law. Assuming 
the results of Chapter 2, this chapter flows smoothly. This is because ideals are also 
normal subgroups and ring homomorphisms are also group homomorphisms. We do 
not show that the polynomial ring F[x] is a unique factorization domain, although 
with the material at hand, it would be easy to do. Also there is no mention of prime 
or maximal ideals, because these concepts are unnecessary for our development of 
linear algebra. These concepts are developed in the Appendix. A section on Boolean 
rings is included because of their importance in logic and computer science. 

Suppose R is an additive abelian group, R ^ 0, and R has a second binary 
operation (i.e., map from R x R to R) which is denoted by multiplication. Consider 
the following properties. 

1) If a, b, c G R, (a ■ b) ■ c = a ■ (b ■ c). (The associative property 
of multiplication.) 

2) If a, b, c G R, a ■ {b + c) = (a • b) + (a • c) and (b + c) ■ a = (b ■ a) + (c • a) 
(The distributive law, which connects addition and 
multiplication.) 

3) R has a multiplicative identity, i.e., there is an element 
1 = 1 R G R such that if a G R, a ■ 1 = 1 • a = a. 

4) If a, b G R, a ■ b = b ■ a. (The commutative property for 
multiplication.) 

Definition If 1), 2), and 3) are satisfied, R is said to be a ring. If in addition 4) 
is satisfied, R is said to be a commutative ring. 

Examples The basic commutative rings in mathematics are the integers Z, the 
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rational numbers Q, the real numbers R, and the complex numbers C. It will be shown 
later that Z„, the integers mod n, has a natural multiplication under which it is a 
commutative ring. Also if R is any commutative ring, we will define R[xi, £2, • ■ ■ , x n ], 
a polynomical ring in n variables. Now suppose R is any ring, n > 1, and R n is the 
collection of all n x n matrices over R. In the next chapter, operations of addition and 
multiplication of matrices will be defined. Under these operations, R n is a ring. This 
is a basic example of a non-commutative ring. If n > 1, R n is never commutative, 
even if R is commutative. 

The next two theorems show that ring multiplication behaves as you would wish 
it to. They should be worked as exercises. 

Theorem Suppose R is a ring and a, b G R. 

1) a • = • a = 0. Since R ^ 0, it follows that 1^0. 

2) (-a)-b = a-(-b) = -(a-b). 



Recall that, since R is an additive abelian group, it has a scalar multiplication 
over Z (page 20). This scalar multiplication can be written on the right or left, i.e., 
na = an, and the next theorem shows it relates nicely to the ring multiplication. 

Theorem Suppose a,b G R and n, m G Z. 

1) (na) ■ (mb) = (nm)(a ■ b). (This follows from the distributive 
law and the previous theorem.) 

2) Let n = n\. For example, 2 = 1 + 1. Then na = n • a, that is, scalar 
multiplication by n is the same as ring multiplication by n. 

Of course, n may be even though n^O. 

Units 



Definition An element a of a ring R is a unit provided 3 an element a l G R 
with a ■ a -1 = a -1 • a = 1. 

Theorem can never be a unit. 1 is always a unit. If o is a unit, a -1 is also a 
unit with (o -1 ) -1 = a. The product of units is a unit with (a • fe) _1 = b~ l ■ a~ l . More 
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generally, if Oi, 02, ..., a n are units, then their product is a unit with (01 • 02 • • ■ On) -1 = 
a" 1 • a~^ x • • • a^ 1 . The set of all units of R forms a multiplicative group denoted by 
R*. Finally if a is a unit, (—a) is a unit and (—a) -1 = —(a -1 ). 

In order for a to be a unit, it must have a two-sided inverse. It suffices to require 
a left inverse and a right inverse, as shown in the next theorem. 

Theorem Suppose a G R and 3 elements b and c with b ■ a = a ■ c = 1. Then 
6 = c and so a is a unit with o _1 = b = c. 

Proof b = b ■ 1 = b ■ (a ■ c) = (b ■ a) ■ c = 1 • c = c. 

Corollary Inverses are unique. 



Domains and Fields In order to define these two types of rings, we first consider 
the concept of zero divisor. 

Definition Suppose R is a commutative ring. An element a G R is called a zero 
divisor provided it is non-zero and 3 a non-zero element b with a ■ b = 0. Note that 
if a is a unit, it cannot be a zero divisor. 

Theorem Suppose R is a commutative ring and a G (R — 0) is not a zero divisor. 
Then (a • b = a ■ c) =>■ 6 = c. In other words, multiplication by a is an injective map 
from R to R. It is surjective iff a is a unit. 

Definition A domain (or integral domain) is a commutative ring such that, if 
a 7^ 0, a is not a zero divisor. A field is a commutative ring such that, if o 7^ 0, a is 
a unit. In other words, R is a field if it is commutative and its non-zero elements 
form a group under multiplication. 

Theorem A field is a domain. A finite domain is a field. 

Proof A field is a domain because a unit cannot be a zero divisor. Suppose R is 
a finite domain and fl^O. Then / : R —>■ R defined by f(b) = a ■ b is injective and, 
by the pigeonhole principle, / is surjective. Thus a is a unit and so R is a field. 



40 Rings Chapter 3 

Exercise Let C be the additive abelian group R 2 . Define multiplication by 
(a, 6) • (c, d) = (ac — bd, ad + be). Show C is a commutative ring which is a field. 
Note that 1 = (1,0) and if i = (0, 1), then i 2 = -1. 

Examples Z is a domain. Q, R, and C are fields. 

The Integers Mod n 



The concept of integers mod n is fundamental in mathematics. It leads to a neat 
little theory, as seen by the theorems below. However, the basic theory cannot be 
completed until the product of rings is defined. (See the Chinese Remainder Theorem 
on page 50.) We know from page 27 that Z n is an additive abelian group. 

Theorem Suppose n > 1. Define a multiplication on Z n by [a] ■ [b] = [ab]. This 
is a well defined binary operation which makes Z n into a commutative ring. 

Proof Since [a + kn] ■ [b + In] = [ab + n(al + bk + kin)] = [ab], the multiplication 
is well-defined. The ring axioms are easily verified. 

Theorem Suppose n > 1 and o G Z. Then the following are equivalent. 

1) [a] is a generator of the additive group Z n . 

2) (o,n) = l. 

3) [a] is a unit of the ring Z n . 

Proof We already know from page 27 that 1) and 2) are equivalent. Recall that 
if b is an integer, [a]b = [a] ■ [b] = [ab]. Thus 1) and 3) are equivalent, because each 
says 3 an integer b with [a]b = [1]. 

Corollary If n > 1, the following are equivalent. 

1) Z n is a domain. 

2) Z n is a field. 

3) n is a prime. 

Proof We already know 1) and 2) are equivalent, because Z„ is finite. Suppose 
3) is true. Then by the previous theorem, each of [1], [2],...,[n — 1] is a unit, and 
thus 2) is true. Now suppose 3) is false. Then n = ab where l<o<n, 1 < b < n, 
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[a] [b] = [0], and thus [a] is a zero divisor and 1) is false. 

Exercise List the units and their inverses for Z 7 and Z 12 . Show that (Z 7 )* is 

a cyclic group but (Z12)* is not. Show that in Z12 the equation x 2 = 1 has four 
solutions. Finally show that if R is a domain, x 2 = 1 can have at most two solutions 
in R (see the first theorem on page 46). 



Subrings Suppose S is a subset of a ring 7?. The statement that S is a subring 
of 7? means that S is a subgroup of the group R, 1 € S , and (a, 6 e 5 =>■ a • 6 e 5). 
Then clearly S is a ring and has the same multiplicative identity as R. Note that Z 
is a subring of Q, Q is a subring of R, and R is a subring of C. Subrings do not play 
a role analogous to subgroups. That role is played by ideals, and an ideal is never a 
subring (unless it is the entire ring). Note that if S is a subring of R and s € S, then 
s may be a unit in R but not in S. Note also that Z and Z n have no proper subrings, 
and thus occupy a special place in ring theory, as well as in group theory. 

Ideals and Quotient Rings 



Ideals in ring theory play a role analagous to normal subgroups in group theory. 

f left 1 

Definition A subset 7 of a ring R is a < right > ideal provided it is a subgroup 

[ 2-sided J 

' a-bel ] 

of the additive group i? and if a G i? and b E I, then < 6 • o G 7 > . The 

a • b and b ■ a e I J 
word "ideal " means "2-sided ideal" . Of course, if R is commutative, every right or 
left ideal is an ideal. 

Theorem Suppose 7? is a ring. 

1) R and are ideals of R. These are called the improper ideals. 

2) If {7 t } tgT is a collection of right (left, 2-sided) ideals of 7?, then ["] I t is a 
right (left, 2-sided) ideal of R. (See page 22.) 
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3) Furthermore, if the collection is monotonic, then I) I t is a right (left, 2-sided) 
ideal of R. 

4) If a G i?, / = oi? is a right ideal. Thus if R is commutative, ai? is an ideal, 
called a principal ideal. Thus every subgroup of Z is a principal ideal, 
because it is of the form riL. 

5) If R is a commutative ring and I C R is an ideal, then the following are 
equivalent. 

i) I = R. 

ii) I contains some unit u. 

iii) I contains 1. 

Exercise Suppose R is a commutative ring. Show that R is a field iff R contains 
no proper ideals. 

The following theorem is just an observation, but it is in some sense the beginning 
of ring theory. 

Theorem Suppose R is a ring and I C R is an ideal, I ^ R. Since I is a normal 
subgroup of the additive group R, R/I is an additive abelian group. Multiplication 
of cosets defined by (a + I) ■ (b + I) = (ab + I) is well-defined and makes R/I a ring. 

Proof (o + I) ■ (b + I) = a ■ b + al + lb + II C a ■ b + I. Thus multiplication 
is well defined, and the ring axioms are easily verified. The multiplicative identity is 
(! + ')■ 

Observation If R = Z, n > 1, and J = nZ, the ring structure on Z n = Z/nZ 
is the same as the one previously defined. 

Homomorphisms 



Definition Suppose R and R are rings. A function / : R — ► i? is a rinj homo- 
morphism provided 

1) / is a group homomorphism 

2) /(!«) = 1^ and 

3) if a, b £ R then /(a • b) = /(a) • /(&). (On the left, multiplication 
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is in R, while on the right multiplication is in R.) 

The kernel of / is the kernel of / considered as a group homomorphism, namely 
ker(/) = f-\0). 



Here is a list of the basic properties of ring homomorphisms. Much of this 
work has already been done by the theorem in group theory on page 28. 

Theorem Suppose each of R and R is a ring. 

1) The identity map Ir : R — ► R is a ring homomorphism. 

2) The zero map from R to R is not a ring homomorphism 
(because it does not send 1 R to 1^). 

3) The composition of ring homomorphisms is a ring homomorphism. 

4) If / : R — > R is a bijection which is a ring homomorphism, 
then f~ l : R — > R is a ring homomorphism. Such an / is called 
a ring isomorphism. In the case R = R, f is also called a 
rmgf automorphism. 

5) The image of a ring homomorphism is a subring of the range. 

6) The kernel of a ring homomorphism is an ideal of the domain. 
In fact, if / : R — ► R is a homomorphism and / C R is an ideal, 
then f~ l (I) is an ideal of R. 

7) Suppose / is an ideal of R, I ^ R, and it : R — ► i?/7 is the 
natural projection, 7r(a) = (a + J). Then 7r is a surjective ring 
homomorphism with kernel J. Furthermore, if / : R — ► ^ is a surjective 
ring homomorphism with kernel /, then i?// pa ^ (see below). 

8) From now on the word "homomorphism" means "ring homomorphism" . 
Suppose / : R — > R is a homomorphism and / is an ideal of R, I ^ R. 
If / C ker(/), then / : R/I -»■ i? defined by /(o + J) = /(a) 
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is a well-defined homomorphism making the following diagram commute. 

/ 
R - R 



TV 



I 



R/I 



Thus denning a homomorphism on a quotient ring is the same as 
denning a homomorphism on the numerator which sends the 
denominator to zero. The image of / is the image of /, and 
the kernel of / is ker(/)/J. Thus if I = ker(/), / is 
injective, and so R/I pa image (/). 

Proof We know all this on the group level, and it is only necessary 
to check that / is a ring homomorphism, which is obvious. 

9) Given any ring homomorphism /, domain(/)/ker(/) ~ image(/). 

Exercise Find a ring R with a proper ideal / and an element b such that b is not 
a unit in R but (b + I) is a unit in R/I. 

Exercise Show that if u is a unit in a ring R, then conjugation by u is an 
automorphism on R. That is, show that / : R — ► R defined by /(a) = -u -1 ■ a ■ u is 
a ring homomorphism which is an isomorphism. 

Exercise Suppose T is a non-void set, R is a ring, and R T is the collection of 
all functions f : T —> R. Define addition and multiplication on R T point-wise. This 
means if / and g are functions from T to R, then (/ + g)(t) = f(t) + g(t) and 
(/ ' d){t) = f(t)g(t)- Show that under these operations R T is a ring. Suppose S is a 
non-void set and a : S — ► T is a function. If / : T — ► i? is a function, define a function 
a*(f) : S ^ R by cc*(f) = f o a. Show a* : i? T — ► R s is a ring homomorphism. 

Exercise Now consider the case T = [0, 1] and R = R. Let ^4 C R' ' 1 ' be the 
collection of all C°° functions, i.e., A ={/ : [0, 1] — > R : / has an infinite number of 
derivatives}. Show A is a ring. Notice that much of the work has been done in the 
previous exercise. It is only necessary to show that A is a subring of the ring R' ' 1 '. 
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Polynomial Rings 

In calculus, we consider real functions / which are polynomials, f(x) = a$ + aiX + 
■ ■ +a n x n . The sum and product of polynomials are again polynomials, and it is easy 
to see that the collection of polynomial functions forms a commutative ring. We can 
do the same thing formally in a purely algebraic setting. 

Definition Suppose R is a commutative ring and x is a "variable" or "symbol" . 
The polynomial ring R[x] is the collection of all polynomials / = ao + ci\x + • • +a n x n 
where a, t G R. Under the obvious addition and multiplication, R[x] is a commutative 
ring. The degree of a non-zero polynomial / is the largest integer n such that a n ^ 0, 
and is denoted by n = deg(/). If the top term a n = 1, then / is said to be monic. 

To be more formal, think of a polynomial ao + a\X + • • • as an infinite sequence 
(ao, Oi, ...) such that each a^ G R and only a finite number are non-zero. Then 
(a , ai, ...) + (b , h, ...) = (a + b , a 1 + b 1 , ...) and 
(a , ai, ...) • (b , Oi, ...) = (a b , a bi + aib , a b 2 + 0161 + a 2 &o, ■■■)■ 
Note that on the right, the ring multiplication a ■ b is written simply as ab, as is 
often done for convenience. 



Theorem If R is a domain, R[x] is also a domain. 

Proof Suppose / and g are non-zero polynomials. Then deg(/)+deg(g) = deg(fg) 
and thus fg is not 0. Another way to prove this theorem is to look at the bottom 
terms instead of the top terms. Let aix 1 and bjX^ be the first non-zero terms of / 
and g. Then aibjX l+: > is the first non-zero term of fg. 

Theorem (The Division Algorithm) Suppose R is a commutative ring, / G 

R[x] has degree > 1 and its top coefficient is a unit in R. (If R is a field, the 
top coefficient of / will always be a unit.) Then for any g G R[x], 3! h,r G R[x] 
such that g = fh + r with r = or deg(r) < deg(/). 

Proof This theorem states the existence and uniqueness of polynomials h and 
r. We outline the proof of existence and leave uniqueness as an exercise. Suppose 
/ = ao + d\X + • • +a m x m where m > 1 and a m is a unit in R. For any g with 
deg(g) < m, set h = and r = g. For the general case, the idea is to divide / into g 
until the remainder has degree less than m. The proof is by induction on the degree 
of g. Suppose n > m and the result holds for any polynomial of degree less than 
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n. Suppose g is a polynomial of degree n. Now 3 a monomial bx l with t = n — m 
and deg(g — fbx f ) < n. By induction, 3 h\ and r with //ii + r = (g — fbx 1 ) and 
deg(r) < m. The result follows from the equation f(h\ + bx f ) + r = g. 

Note If r = we say that / divides g. Note that f = x — c divides g iff c is 
a root of g, i.e., g(c) = 0. More generally, x — c divides g with remainder g(c). 

Theorem Suppose R is a domain, n > 0, and g(x) = a + aix + • • • + a n x n is a 
polynomial of degree n with at least one root in R. Then g has at most n roots. Let 
Ci, C2, .., Cfc be the distinct roots of g in the ring i?. Then 3 a unique sequence of 
positive integers ni,ri2, ■■, rik and a unique polynomial h with no root in R so that 
<7(x) = (x — C\) ni ■ ■ ■ (x — Ck) nk h(x). (If h has degree 0, i.e., if h = a n , then we say 
"all the roots of g belong to R v . If g = a n x n , we say "all the roots of g are 0" .) 

Proof Uniqueness is easy so let's prove existence. The theorem is clearly true 
for n = 1. Suppose n > 1 and the theorem is true for any polynomial of degree less 
than n. Now suppose g is a polynomial of degree n and c\ is a root of g. Then 3 
a polynomial h\ with g(x) = (x — C\)h\. Since ft^ has degree less than n, the result 
follows by induction. 

Note If g is any non-constant polynomial in C[x], all the roots of g belong to C, 
i.e., C is an algebraically closed field. This is called The Fundamental Theorem of 
Algebra, and it is assumed without proof for this textbook. 

Exercise Suppose g is a non-constant polynomial in R[x]. Show that if g has 
odd degree then it has a real root. Also show that if g(x) = x 2 + bx + c, then it has 
a real root iff b 2 > 4c, and in that case both roots belong to R. 

Definition A domain T is a principal ideal domain (PID) if, given any ideal /, 
3 t G T such that / = tT. Note that Z is a PID and any field is PID. 

Theorem Suppose F is a field, / is a proper ideal of F[x], and n is the smallest 
positive integer such that I contains a polynomial of degree n. Then / contains a 
unique polynomial of the form / = ao + a±x + • • +a n -\X n ~ 1 + x n and it has the 
property that / = fF\x\. Thus F[x] is a PID. Furthermore, each coset of I can be 
written uniquely in the form (co + C\X + • • +c n -\X n ~ 1 + /). 

Proof. This is a good exercise in the use of the division algorithm. Note this is 

similar to showing that a subgroup of Z is generated by one element (see page 15). 
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Theorem. Suppose R is a subring of a commutative ring C and c E C. Then 

3! homomorphism h : R[x] — ► C with /t(x) = c and /i(r) = r for all r E R. It is 
defined by h(a + aire + • • +a n x n ) = ao + aic + • • +a n c n , i.e., h sends /(#) to /(c). 
The image of h is the smallest subring of C containing R and c. 

This map h is called an evaluation map. The theorem says that adding two 
polynomials in R[x] and evaluating is the same as evaluating and then adding in C. 
Also multiplying two polynomials in R[x] and evaluating is the same as evaluating 
and then multiplying in C . In street language the theorem says you are free to send 
x wherever you wish and extend to a ring homomorphism on R[x\. 

Exercise Let C = {o + bi : a, b G R}. Since R is a subring of C, there exists a 
homomorphism h : R[x] — > C which sends x to i, and this h is surjective. Show 
kex(h) = (x 2 + l)R[a;] and thus R[z]/(a; 2 + 1) ~ C. This is a good way to look 
at the complex numbers, i.e., to obtain C, adjoin x to R and set x 2 = — 1. 

Exercise Z 2 [x]/(x 2 + x + 1) has 4 elements. Write out the multiplication table 
for this ring and show that it is a field. 

Exercise Show that, if R is a domain, the units of R[x] are just the units of R. 
Thus if F is a field, the units of F[x] are the non-zero constants. Show that [1] + [2]x 
is a unit in ZJx]. 



In this chapter we do not prove F[x] is a unique factorization domain, nor do 
we even define unique factorization domain. The next definition and theorem are 
included merely for reference, and should not be studied at this stage. 

Definition Suppose F is a field and / G F[x] has degree > 1. The statement 
that g is an associate of / means 3 a unit u G F[x] such that g = uf . The statement 
that / is irreducible means that if h is a non-constant polynomial which divides /, 
then h is an associate of /. 

We do not develop the theory of F[x] here. However, the development is easy 
because it corresponds to the development of Z in Chapter 1. The Division Algo- 
rithm corresponds to the Euclidean Algorithm. Irreducible polynomials correspond 
to prime integers. The degree function corresponds to the absolute value function. 
One difference is that the units of F[x] are non-zero constants, while the units of Z 
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are just ±1. Thus the associates of / are all cf with c^O while the associates of an 
integer n are just ±n. Here is the basic theorem. (This theory is developed in full in 
the Appendix under the topic of Euclidean domains.) 

Theorem Suppose F is a field and / G F[x] has degree > 1. Then / factors as the 
product of irreducibles, and this factorization is unique up to order and associates. 
Also the following are equivalent. 

1) F[x]/(f) is a domain. 

2) F[x]/{f) is a field. 

3) / is irreducible. 



Definition Now suppose x and y are "variables". If a G R and n,m > 0, then 

ax n y m = ay m x n is called a monomial. Define an element of R[x, y] to be any finite 
sum of monomials. 

Theorem R[x, y] is a commutative ring and (i?[x])[y] ~ R[x, y] ~ (i?[y])[x]. In 
other words, any polynomial in x and y with coefficients in R may be written as a 
polynomial in y with coefficients in R[x], or as a polynomial in x with coefficients in 

R[y}- 

Side Comment It is true that if F is a field, each / G F[x, y] factors as the 
product of irreducibles. However F[x, y] is not a P1D. For example, the ideal 
/ = xF[x, y] + yF[x, y] = {/ G F[x, y] : /(Q, 0) = 0} is not principal. 

If i? is a commutative ring and n > 2, the concept of a polynomial ring in 
n variables works fine without a hitch. If a G R and i>i,i>2, ...,t>n are non-negative 
integers, then ax^ 1 x% 2 • ■ • x^ n is called a monomial. Order does not matter here. 
Define an element of R[xi, X2, ■■■, x n ] to be any finite sum of monomials. This 
gives a commutative ring and there is canonical isomorphism R[xi, X2, ■■■, x n ] ~ 
(R[xi,X2, ...,x n _i])[x n ]. Using this and induction on n, it is easy to prove the fol- 
lowing theorem. 

Theorem If R is a domain, R[xi, X2, ■■■, x n ] is a domain and its units are just the 
units of R. 
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Exercise Suppose R is a commutative ring and / : R[x, y] — ► R[x] is the eval- 
uation map which sends y to 0. This means f(p(x,y)) = p(x,Q). Show / is a ring 
homomorphism whose kernel is the ideal (y) = yR[x,y]. Use the fact that "the do- 
main mod the kernel is isomorphic to the image" to show R[x,y]/(y) is isomorphic 
to R[x\. That is, if you adjoin y to R[x] and then factor it out, you get R[x] back. 

Product of Rings 



The product of rings works fine, just as does the product of groups. 

Theorem Suppose T is an index set and for each t E T, R t is a ring. On the 
additive abelian group TT R t = Y\Rt, define multiplication by {r t } ■ {s t } = {r t ■ s t }. 

Then n Rt is a ring and each projection n s : n Rt —> Rs is a r i n g homomorphism. 
Suppose R is a ring. Under the natural bijection from {functions / : R — ► n Rt} 
to {sequences of functions {ft}t&T where f t :R—y R t }, f is a ring homomorphism 
iff each f t is a ring homomorphism. 

Proof We already know / is a group homomorphism iff each f t is a group homo- 
morphism (see page 36). Note that {l t } is the multiplicative identity of n Rt, and 
/{Ir) — {It} iff /t(li?) = It f° r each t E T. Finally, since multiplication is defined 
coordinatewise, / is a ring homomorphism iff each f t is a ring homomorphism. 

Exercise Suppose R and S are rings. Note that R x is not a subring of R x S 
because it does not contain (1 R , l s ). Show R x is an ideal and (R x S/R x 0) ~ S. 
Suppose I G R and J C S are ideals. Show I x J is an ideal of R x S and every 
ideal of R x S is of this form. 

Exercise Suppose R and S are commutative rings. Show T = R x S is not a 
domain. Let e = (1, 0) E R x S and show e 2 = e, (I — e) 2 = (I — e), i? x = eT, 
and x S = (1 - e)T. 

Exercise If T is any ring, an element e of T is called an idempotent provided 
e 2 = e. The elements and 1 are idempotents called the trivial idempotents. Suppose 
T is a commutative ring and e E T is an idempotent with ^ e ^ 1 . Let R = eT 
and 5 = (1 — e)T. Show each of the ideals i? and 5 is a ring with identity, and 
/ : T — ► i? x 5* defined by /(£) = (et, (1— e)t) is a ring isomorphism. This shows that 
a commutative ring T splits as the product of two rings iff it contains a non-trivial 
idempotent. 
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The Chinese Remainder Theorem 

The natural map from Z to Z m x Z n is a group homomorphism and also a ring 
homomorphism. If m and n are relatively prime, this map is surjective with kernel 
mnZ, and thus Z mn and Z m x Z„ are isomorphic as groups and as rings. The next 
theorem is a classical generalization of this. (See exercise three on page 35.) 

Theorem Suppose ni,...,n t are integers, each rii > 1, and (ni,rij) = 1 for all 
i 7^ j. Let fi : Z — ► Z n% be defined by fi(a) = [a]. (Note that the bracket symbol is 
used ambiguously.) Then the ring homomorphism / = (/ 1; .., f t ) : Z — ► Z ni x • • xZ nt 
is surjective. Furthermore, the kernel of / is nZ, where n = riiri2 ■ ■ n t . Thus Z„ 
and Z ni x • • xZ ni are isomorphic as rings, and thus also as groups. 

Proof We wish to show that the order of /(l) is n, and thus /(l) is a group 
generator, and thus / is surjective. The element f(l)m = ([1], .., [l])m = ([m], .., [m]) 
is zero iff m is a multiple of each of n\, .., n t . Since their least common multiple is n, 
the order of /(l) is n. (See the fourth exercise on page 36 for the case t = 3.) 

Exercise Show that if a is an integer and p is a prime, then [a] = [a p ] in Z p 
(Fermat's Little Theorem). Use this and the Chinese Remainder Theorem to show 
that if b is a positive integer, it has the same last digit as b 5 . 

Characteristic 



The following theorem is just an observation, but it shows that in ring theory, the 
ring of integers is a "cornerstone" . 

Theorem If R is a ring, there is one and only one ring homomorphism / : Z — ► R. 
It is given by f(m) = ml = m. Thus the subgroup of R generated by 1 is a subring 
of R isomorphic to Z or isomorphic to Z n for some positive integer n. 

Definition Suppose R is a ring and / : Z — ► R is the natural ring homomor- 
phism f(m) = ml = m. The non-negative integer n with ker(/) = riL is called the 
characteristic of R. Thus / is injective iff R has characteristic iff 1 has infinite 
order. If / is not injective, the characteristic of R is the order of 1. 

It is an interesting fact that, if R is a domain, all the non-zero elements of R 
have the same order. (See page 23 for the definition of order.) 
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Theorem Suppose R is a domain. If R has characteristic 0, then each non-zero 
a G R has infinite order. If R has finite characteristic n, then n is a prime and each 
non-zero a G R has order n. 

Proof Suppose R has characteristic 0, a is a non-zero element of R, and m is a 
positive integer. Then ma = m • a cannot be because m,a/0 and R is a domain. 
Thus o(a) = oo. Now suppose i? has characteristic n. Then i? contains Z n as a 
subring, and thus Z„ is a domain and n is a prime. If a is a non-zero element of R, 
na = n • a = 0-a =0 and thus o(a)\n and thus o(a) = n. 

Exercise Show that if F is a field of characteristic 0, F contains Q as a subring. 
That is, show that the injective homomorphism / : Z — ► F extends to an injective 
homomorphism / : Q — ► F. 

Boolean Rings 



This section is not used elsewhere in this book. However it fits easily here, and is 
included for reference. 

Definition A ring R is a Boolean ring if for each a G R, a 2 = a, i.e., each 

element of R is an idempotent. 

Theorem Suppose R is a Boolean ring. 

1) R has characteristic 2. If a G R, 2o = a + a = 0, and so a = —a. 
Proof (o + o) = (a + a) 2 = a 2 + 2o 2 + a 2 = 4a. Thus 2o = 0. 

2) R is commutative. 

Proof (a + 6) = (a + 6) 2 = a 2 + (a • 6) + (6 • a) + 6 2 
= a + (o • 6) — (6 • a) + 6. Thus a ■ b = b ■ a. 

3) If i? is a domain, i? pa Z 2 . 

Proof Suppose a^O. Then o • (1 — o) = and so a = 1. 

4) The image of a Boolean ring is a Boolean ring. That is, if I is an ideal 
of R with I ^ R, then every element of R/I is idempotent and thus 
R/I is a Boolean ring. It follows from 3) that R/I is a domain iff R/I 
is a field iff R/I fa Z 2 . (In the language of Chapter 6, I is a prime 
ideal iff I is a maximal ideal iff R/I m Z 2 ). 
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Suppose X is a no n- void set. If a is a subset of X , let a' = (X — a) be a complement 
of a in X. Now suppose i? is a no n- void collection of subsets of X. Consider the 
following properties which the collection R may possess. 

1) a G R =^ a' G R. 

2) a,be R ^ (anb) e R. 

3) a, 6 G i? =^ (oU6) G i?. 

4) G i? and X G R. 

Theorem If 1) and 2) are satisfied, then 3) and 4) are satisfied. In this case, R 
is called a Boolean algebra of sets. 

Proof Suppose 1) and 2) are true, and a,b G R. Then a U b = (a' fl 6')' belongs to 
i? and so 3) is true. Since i? is non-void, it contains some element a. Then = a n a' 
and X = a U a' belong to R, and so 4) is true. 

Theorem Suppose R is a Boolean algebra of sets. Define an addition on R by 
a + b = (a U b) — (o fl b). Under this addition, R is an abelian group with = and 
a = —a. Define a multiplication on R by o • b = a n 6. Under this multiplication i? 
becomes a Boolean ring with 1 = X. 

Exercise Let X = {1,2, ...,n} and let R be the Boolean ring of all subsets of 
X. Note that o(R) = 2 n . Define f t : R -> Z 2 by fi(a) = [1] iff i G a. Show each 
/i is a homomorphism and thus / = (/i, ..., / n ) : -R — ► Z 2 x Z 2 x • • xZ 2 is a ring 
homomorphism. Show / is an isomorphism. (See exercises 1) and 4) on page 12.) 

Exercise Use the last exercise on page 49 to show that any finite Boolean ring is 
isomorphic to Z 2 x Z 2 x • • xZ 2 , and thus also to the Boolean ring of subsets above. 

Note Suppose R is a Boolean ring. It is a classical theorem that 3 a Boolean 
algebra of sets whose Boolean ring is isomorphic to R. So let's just suppose R is 
a Boolean algebra of sets which is a Boolean ring with addition and multiplication 
defined as above. Now define a V b = a U b and a A b = a fl b. These operations cup 
and cap are associative, commutative, have identity elements, and each distributes 
over the other. With these two operations (along with complement), R is called a 
Boolean algebra. R is not a group under cup or cap. Anyway, it is a classical fact 
that, if you have a Boolean ring (algebra), you have a Boolean algebra (ring). The 
advantage of the algebra is that it is symmetric in cup and cap. The advantage of 
the ring viewpoint is that you can draw from the rich theory of commutative rings. 
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Matrices and Matrix Rings 



We first consider matrices in full generality, i.e., over an arbitrary ring R. However, 
after the first few pages, it will be assumed that R is commutative. The topics, 
such as invertible matrices, transpose, elementary matrices, systems of equations, 
and determinant, are all classical. The highlight of the chapter is the theorem that a 
square matrix is a unit in the matrix ring iff its determinant is a unit in the ring. 
This chapter concludes with the theorem that similar matrices have the same deter- 
minant, trace, and characteristic polynomial. This will be used in the next chapter 
to show that an endomorphism on a finitely generated vector space has a well-defined 
determinant, trace, and characteristic polynomial. 

Definition Suppose R is a ring and m and n are positive integers. Let R m , n be 
the collection of all m x n matrices 



A = (aij) 



/ Oi i . . . a,\ n \ 



\ 0"m,l • • • Q"m,n J 



where each entry a« ,• G R. 



A matrix may be viewed as m n-dimensional row vectors or as n m-dimensional 
column vectors. A matrix is said to be square if it has the same number of rows 
as columns. Square matrices are so important that they have a special notation, 
R n = R n , n . R n is defined to be the additive abelian group R x R x • • • x R. 
To emphasize that R n does not have a ring structure, we use the "sum" notation, 
R n = R © R © • • • © R. Our convention is to write elements of R n as column vectors, 
i.e., to identify R n with R n ^. If the elements of R n are written as row vectors, R n is 
identified with R\ n . 
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Addition of matrices To "add" two matrices, they must have the same number 
of rows and the same number of columns, i.e., addition is a binary operation R m ,n x 
Rm, n —> Rm,n- The addition is defined by (aij) + {hj) = (a«j + bij), i.e., the i,j term 
of the sum is the sum of the i,j terms. The following theorem is just an observation. 



^■m.n 



Theorem R m ,n is an additive abelian group. Its "zero" is the matrix = r 
all of whose terms are zero. Also — (ojj) = (— cn,j)- Furthermore, as additive groups, 

D ~ r>mn 

1 ^m..rt. ~ ^t 



Scalar multiplication An element of R is called a scalar. A matrix may be 
"multiplied" on the right or left by a scalar. Right scalar multiplication is defined 
by (a,ij)c = (a,ij • c). It is a function R m , n xi? -> R m ,n- Note in particular that 
scalar multiplication is defined on R n . Of course, if R is commutative, there is no 
distinction between right and left scalar multiplication. 

Theorem Suppose A, B G R m ,n and c,d G R. Then 

(A + B)c =Ac + Bc 
A(c + d) = Ac + Ad 
A(cd) = (Ac)d 
and Al = A 

This theorem is entirely transparent. In the language of the next chapter, it merely 
states that R m n is a right module over the ring R. 



Multiplication of Matrices The matrix product AB is defined iff the number 
of columns of A is equal to the number of rows of B. The matrix AB will have the 
same number of rows as A and the same number of columns as B, i.e., multiplication 
is a function R m ,n x R n ,p —* R m , P - The product (ai,j)(&ij) is defined to be the matrix 
whose (s,t) term is a Sj i • &i jt + • • • + a s ^ n ■ b n>t , i.e., the dot product of row s of A 
with column t of B. 






Consider real matrices ^4= ^)'^ = n )'^ = 

Find the matrices AU, UA, AV, VA, AW, and WA. 
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Definition The identity matrix I n G R n is the square matrix whose diagonal terms 
are 1 and whose off-diagonal terms are 0. 



Theorem Suppose A G R m ,n 

*) i-m-A = A = j4i n 



-V iip,m^ iip,n ^Lin,p iim,p 



Theorem (The distributive laws) (A + B)C = AC + 5C and 

C(j4 + 5) = CA + C5 whenever the 
operations are defined. 

Theorem (The associative law for matrix multiplication) Suppose A G R m ,n, 
B G R n , p , and C G R p , q . Then (AB)C = A(BC). Note that ^5C G R m , q . 

Proof We must show that the (s,t) terms are equal. The proof involves writing 
it out and changing the order of summation. Let (xjj) = AB and (yij) = BC . 
Then the (s,t) term of (AB)C is ^x Syi c iyt = Xl(XXA*) c M = XX,A* C M = 

i i j i,j 

'^2, a s,jW2Pj,i c i,tj = '^2 a s,jyj,t which is the (s,t) term of A(BC). 



Theorem For each ring R and integer n > 1, i?„ is a ring. 

Proof This elegant little theorem is immediate from the theorems above. The 
units of R n are called invertible or non-singular matrices. They form a group under 
multiplication called the general linear group and denoted by GL n (R) = (R n )*. 

Exercise Recall that if A is a ring and a G A, then aA is right ideal of A. Let 
A = i?2 and o = (a^j) where ai ; i = 1 and the other entries are 0. Find aR<i and i?20- 
Show that the only ideal of i?2 containing a is i?2 itself. 



Multiplication by blocks Suppose A,Ee R n , B,F G R n ,m, C,G G R m ,n, and 
D,H £ R m . Then multiplication in R n + m is given by 

/ 4 b \( e f \ _ ( AE + BG AF + BH\ 
\ C D ) { G HI ~ \ CE + DG CF + DH j ' 
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Transpose 

Notation For the remainder of this chapter on matrices, suppose R is a commu- 
tative ring. Of course, for n > 1, R n is non-commutative. 

Transpose is a function from R m ,n to R n ,m- If A G R„ hn , A 1 G R n ,m is the matrix 
whose (i,j) term is the (j,i) term of A. So row i (column i) of A becomes column 
i (row i) of A 1 . If ^4 is an n- dimensional row vector, then A 1 is an n-dimensional 
column vector. If A is a square matrix, A 1 is also square. 

Theorem 1) (A*)* = A 

2) {A + BY = A* + B f 

3) If c G R, (Ac)' = A f c 

4) (ABY = B l A l 

5) If A G i? n , then A is invertible iff A* is invertible. 
In this case (.4" 1 )* = (A*)" 1 . 

Proof of 5) Suppose A is invertible. Then I = R = (AA~ l Y = (A'^A 1 . 

Exercise Characterize those invertible matrices A G R2 which have A~ l = A 1 . 
Show that they form a subgroup of GL2CR.). 

Triangular Matrices 



If A G R n , then A is upper (lower) triangular provided a,ij = for all i > j (all 
j > i). A is strictly upper (lower) triangular provided a^j = for all i > j (all j > i). 
A is diagonal if it is upper and lower triangular, i.e., a^j = for all i ^ j. Note 
that if A is upper (lower) triangular, then A 1 is lower (upper) triangular. 

Theorem If A G R n is strictly upper (or lower) triangular, then A n = 0. 

Proof The way to understand this is just multiply it out for n = 2 and n = 3. 
The geometry of this theorem will become transparent later in Chapter 5 when the 
matrix A defines an i?-module endomorphism on R n (see page 93). 

Definition If T is any ring, an element t G T is said to be nilpotent provided 3n 
such that t n = 0. In this case, (I — t) is a unit with inverse I + t + t 2 + • • • + t n ~ l . 
Thus if T = R n and B is a nilpotent matrix, I — B is invertible. 
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Exercise 



Let R = Z. Find the inverse of 




Exercise Suppose A 



a i 



0-2 







V 



is a diagonal matrix, B G R mn , 



a n J 



and C G R n , P - Show that BA is obtained from B by multiplying column i oi B 
by Oj. Show AC is obtained from C by multiplying row i of C by ttj. Show ^4 is a 
unit in i? n iff each a« is a unit in R. 



Scalar matrices A scalar matrix is a diagonal matrix for which all the diagonal 
terms are equal, i.e., a matrix of the form cl n . The map R — ► i? n which sends c to 
c/„ is an injective ring homomorphism, and thus we may consider R to be a subring 
of R n . Multiplying by a scalar is the same as multiplying by a scalar matrix, and 
thus scalar matrices commute with everything, i.e., if B G R n , (cI n )B = cB = Be = 
B(cl n ). Recall we are assuming R is a commutative ring. 

Exercise Suppose A G R n and for each B G R n , AB = BA. Show A is a scalar 
matrix. For n > 1, this shows how non- commutative i?„ is. 



Elementary Operations and Elementary Matrices 



Suppose R is a commutative ring and A is a matrix over i?. There are 3 types of 
elementary row and column operations on the matrix A. A need not be square. 



Type 1 Multiply row i by some 
unit a G R. 



Multiply column i by some 
unit a G R. 



Type 2 
Type 3 



Interchange row i and row j. Interchange column i and column j. 



Add a times row j 

to row i where i ^ j and a 

is any element of R. 



Add a times column i 

to column j where i ^ j and a 

is any element of R. 



5£ 
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Elementary Matrices Elementary matrices are square and invertible. There 
are three types. They are obtained by performing row or column operations on the 
identity matrix. 



Typel 



B 



/I \ 

1 

a 

1 

1 

V i / 



where a is a unit in R. 



Type 2 



B 



( 1 \ 

1 
1 

1 

1 
V 1 ) 



Type 3 



/ 1 



B 



\ 



'■1,3 



V 



1/ 



where i ^ j and ajj is 
any element of R. 



In type 1, all the off-diagonal elements are zero. In type 2, there are two non-zero 
off-diagonal elements. In type 3, there is at most one non-zero off-diagonal element, 
and it may be above or below the diagonal. 

Exercise Show that if B is an elementary matrix of type 1,2, or 3, then B is 
invertible and B~ x is an elementary matrix of the same type. 

The following theorem is handy when working with matrices. 



Theorem Suppose A is a matrix. It need not be square. To perform an elemen- 
tary row (column) operation on A, perform the operation on an identity matrix to 
obtain an elementary matrix B, and multiply on the left (right). That is, BA = row 
operation on A and AB = column operation on A. (See the exercise on page 54.) 
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Exercise 

1) 

2) 
3) 



4) 



Suppose F is a field and A G F m ^ n . 

Show 3 invertible matrices B G F m and C E F n such that BAC = (dij) 
where d\^ = • • • = d tt = 1 and all other entries are 0. The integer t is 
called the rank of A. (See page 89 of Chapter 5.) 

Suppose A G F n is invertible. Show A is the product of elementary 
matrices. 

A matrix T is said to be in row echelon form if, for each 1 < i < m, the 
first non-zero term of row (i + V) is to the right of the first non-zero 
term of row i. Show 3 an invertible matrix B G F m such that BA is in 
row echelon form. 



Let A 



Write A and D as products 



I 4 1 ) andD= (l 4, 

of elementary matrices over Q. Is it possible to write them as products 
of elementary matrices over Z? 



For 1), perform row and column operations on A to reach the desired form. This 
shows the matrices B and C may be selected as products of elementary matrices. 
Part 2) also follows from this procedure. For part 3), use only row operations. Notice 
that if T is in row-echelon form, the number of non-zero rows is the rank of T. 



Systems of Equations 



Suppose A = (Ojj) G Rm,n an d C 



( c l \ 



V c m / 



G R m = R ml . The system 



CL\ \X\ 



^l,n^n ~ ^1 



of m equations in n unknowns, can be written as one 



^m,l^l ~t~ ' " ' ~r Ciyyi n X n C Tl 



matrix equation in one unknown, namely as (o 



1,3 J 



/ Xi \ 



\ x ra / 



/ Cl \ 



\ Cm / 



or AX = C. 
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Define / : R n — ► R m by /(-D) = AD. Then / is a group homomorphism and also 
f(Dc) = f(D)c for any c G R. In the language of the next chapter, this says that 
/ is an i?-module homomorphism. The next theorem summarizes what we already 
know about solutions of linear equations in this setting. 

Theorem 

1) AX = is called the homogeneous equation. Its solution set is ker(/). 

2) AX = C has a solution iff C G image(/). If D G R n is one 
solution, the solution set f~ l (C) is the coset D + ker(/) in R n . 
(See part 7 of the theorem on homomorphisms in Chapter 2, page 28.) 

3) Suppose B G R m is invertible. Then AX = C and (BA)X = BC have 
the same set of solutions. Thus we may perform any row operation 
on both sides of the equation and not change the solution set. 

4) If m = n and A G R m is invertible, then AX = C has the unique 
solution X = A~ X C. 

The geometry of systems of equations over a field will not become really trans- 
parent until the development of linear algebra in Chapter 5. 

Determinants 



The concept of determinant is one of the most amazing in all of mathematics. 
The proper development of this concept requires a study of multilinear forms, which 
is given in Chapter 6. In this section we simply present the basic properties. 

For each n > 1 and each commutative ring R, determinant is a function from R n 

' a b 



to R. For n = 1, | (a) | = a. For n 



c d 



ad — be. 



Definition Let A = (ojj) G R n . If a is a permutation on {1, 2, ...,n}, let sign(a) = 
1 if a is an even permutation, and sign(cr) = —1 if a is an odd permutation. The 
determinant is defined by | A |= J^ sign (a) a liCr ( 1 ) • a 2)CT (2) ■ ■ ■ Un^in)- Check that for 

alio" 

n = 2, this agrees with the definition above. (Note that here we are writing the 
permutation functions as a(i) and not as (i)cr.) 
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For each a, cii,a(i) ' a 2,a(2) • • • a n,a{n) contains exactly one factor from each row and 
one factor from each column. Since R is commutative, we may rearrange the factors 
so that the first comes from the first column, the second from the second column, etc. 
This means that there is a permutation r on {1, 2, . . . , n} such that ai,<r(i) ■ ■ ■ <^n,a(n) = 
°t(i),i ' " ' a r{n),n- We wish to show that r = a~ l and thus sign(a) = sign(r). To 
reduce the abstraction, suppose a(2) = 5. Then the first expression will contain 
the factor 02,5. In the second expression, it will appear as a r (5),5) and so r(5) = 2. 
Anyway, r is the inverse of a and thus there are two ways to define determinant. It 
follows that the determinant of a matrix is equal to the determinant of its transpose. 

Theorem |A| = ^ si g n (°')oi,<7(i) ■ a 2,a(2) ■ ■ ■ a n ^ n) = ^ sign(r)o T (i) j i • o T ( 2 ),2 ■ ■ ■ ar(n),n- 

all a all r 



Corollary \A\ = \A l \. 

You may view an n x n matrix A as a sequence of n column vectors or as a 
sequence of n row vectors. Here we will use column vectors. This means we write the 
matrix A as A = (A l7 A 2 , . . . , A n ) where each A^ G i? n> i = R n . 

Theorem If two columns of A are equal, then \A\ — 0. 

Proof For simplicity, assume the first two columns are equal, i.e., A\ = A 2 . 
Now \A\ = J^ sign(r)o r (i) j i • a T (2),2 ■ ■ ■ a r(n),n and this summation has n\ terms and 

allr 

n\ is an even number. Let 7 be the transposition which interchanges one and two. 
Then for any r, a T (i),i ■ Or(2),2 ■ ■ ■ a-r(n),n = a r 7 (i), 1 ■ ar 7 (2),2 ■ ■ ■ a Ty ( n ) :n . This pairs up 
the n! terms of the summation, and since sign(r)=— sign(r7), these pairs cancel 
in the summation. Therefore |A| = 0. 

Theorem Suppose 1 < r < n, C r G R n ,i, and a,c G R. Then \(Ai, . . . , A r _i, 
aA r + cC r ,A r+1 ,...,A n )\ = a\(Ai, . . . , A n )\ + c\(Ai,...,A r _ 1 ,C r ,A r+ i,...,A n )\ 

Proof This is immediate from the definition of determinant and the distributive 
law of multiplication in the ring R. 

Summary Determinant is a function d : R n — ► R. In the language used in the 
Appendix, the two previous theorems say that d is an alternating multilinear form. 
The next two theorems show that alternating implies skew-symmetric (see page 129). 
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Theorem Interchanging two columns of A multiplies the determinant by minus 
one. 

Proof For simplicity, show that \(A 2 , Ai,A 3 , . . . ,A n )\ = —\A\. We know = 
|(Ai + A 2 ,A 1 + A 2 , A 3 , . . . , A n )\ = \(A 1 ,A 1 ,A 3 ,...,A n )\ + \(A U A 2 , A 3 , . . . , A n )\ + 
\(A 2 , A\, A 3 , . . . , A n )\ + 1(^2, A 2 , A3, . . . , A n )\. Since the first and last of these four 
terms are zero, the result follows. 

Theorem If r is a permutation of (1, 2, . . . , n), then 
\A\ = Bign(r)|(A r (i),A r ( 2 ),...,A r ( n ))|. 

Proof The permutation r is the finite product of transpositions. 

Exercise Rewrite the four preceding theorems using rows instead of columns. 

The following theorem is just a summary of some of the work done so far. 

Theorem Multiplying any row or column of matrix by a scalar c G R, multiplies 
the determinant by c. Interchanging two rows or two columns multiplies the determi- 
nant by —1. Adding c times one row to another row, or adding c times one column 
to another column, does not change the determinant. If a matrix has two rows equal 
or two columns equal, its determinant is zero. More generally, if one row is c times 
another row, or one column is c times another column, then the determinant is zero. 



There are 2n ways to compute | A |; expansion by any row or expansion by any 
column. Let M^ be the determinant of the (n — 1) x (n — 1) matrix obtained by 
removing row i and column j from A. Let Cij = (—l) l+: >Mij. Mij and Cij are 
called the (i,j) minor and cof actor of A. The following theorem is useful but the 
proof is a little tedious and should not be done as an exercise. 

Theorem For any 1 < i < n, | A |= a^iC^i + cii y2 Ci y2 + • • • + Oi in Cj jn . For any 
1 < J ' < n, \A\= aijCij + a 2 jC 2 j + • • • + a n jC n j. Thus if any row or any column is 
zero, the determinant is zero. 



Exercise Let A = \ bi bo b 3 \ . The determinant of A is the sum of six terms. 



0,1 


a 2 


0,3 


61 


b 2 


b 3 


C\ 


(■■2 


c 3 
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Write out the determinant of A expanding by the first column and also expanding by 
the second row. 

Theorem If A is an upper or lower triangular matrix, | A | is the product of the 
diagonal elements. If A is an elementary matrix of type 2, | A |= — 1. If A is an 
elementary matrix of type 3, | A |= 1. 

Proof We will prove the first statement for upper triangular matrices. If A G Ri 
is an upper triangular matrix, then its determinant is the product of the diagonal 
elements. Suppose n > 2 and the theorem is true for matrices in R n -\. Suppose 
A G R n is upper triangular. The result follows by expanding by the first column. 

An elementary matrix of type 3 is a special type of upper or lower triangular 
matrix, so its determinant is 1. An elementary matrix of type 2 is obtained from the 
identity matrix by interchanging two rows or columns, and thus has determinant —1. 

Theorem (Determinant by blocks) Suppose A G R n , B G R n ,mi and D G R m . 
Then the determinant of is | A \ \ D \ . 

Proof Expand by the first column and use induction on n. 

The following remarkable theorem takes some work to prove. We assume it here 
without proof. (For the proof, see page 130 of the Appendix.) 

Theorem The determinant of the product is the product of the determinants, 
i.e., if A,B G R n , \AB\ = \A\\B\. Thus \AB\ = \BA\ and if C is invertible, 

Corollary If A is a unit in R n , then \A\ is a unit in R and | A -1 1 = | A | _1 . 
Proof 1 = 1 1 1 = | AA- 1 1 = | A 1 1 A- 1 \ . 

One of the major goals of this chapter is to prove the converse of the preceding 
corollary. 



Classical adjoint Suppose R is a commutative ring and A G R n . The classical 
adjoint of A is (Cjj)', i.e., the matrix whose (j,i) term is the (i,j) cofactor. Before 
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we consider the general case, let's examine 2x2 matrices. 

If A- ( I \ ) then (C,.,) - ( _ d b ~l ) and SO (C,,,)' - ( _^ "» ). Then 

A(C itj y = (CijfA = ( J I 4 I ) = ' ^ ' 7 ' Thus if I A I is a unit in i? ' ^ is 

invertible and A~ l = \ A \~ l (Cjj)*. In particular, if | A \ — 1, A -1 = I 
Here is the general case. 

Theorem If R is commutative and A G R n , then A(Cjj)* = (C^YA = \A\ I. 

Proof We must show that the diagonal elements of the product A(C.ijY are an 
| A | and the other elements are 0. The (s, s) term is the dot product of row s of A 
with row s of (Cjj) and is thus | ^4 | (computed by expansion by row s). For s ^ t, 
the (s, t) term is the dot product of row s of A with row t of (Cij). Since this is the 
determinant of a matrix with row s = row t, the (s,t) term is 0. The proof that 
{CijYA — \A\I is similar and is left as an exercise. 

We are now ready for one of the most beautiful and useful theorems in all of 
mathematics. 



Theorem Suppose R is a commutative ring and A G R n . Then A is a unit in 
R n iff | A | is a unit in R. (Thus if R is a field, A is invertible iff | A \ ^ 0.) If A is 

invertible, then A' 1 = \A\ _1 {C id Y- Thus if \A\ =1, A' 1 = {C itJ Y, the classical 
adjoint of A. 

Proof This follows immediately from the preceding theorem. 

Exercise Show that any right inverse of A is also a left inverse. That is, suppose 
A,B G R n and AB = I. Show A is invertible with A' 1 = B, and thus BA = I. 

Similarity 



Suppose A, B G R n . B is said to be similar to A if 3 an invertible C G R n such 
that B = C~ l AC, i.e., B is similar to A iff B is a conjugate of A. 

Theorem £> is similar to B. 
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B is similar to A iff A is similar to B. 

If D is similar to B and B is similar to A, then I? is similar to A. 

"Similarity" is an equivalence relation on R n . 

Proof This is a good exercise using the definition. 

Theorem Suppose A and B are similar. Then \A\ = \B\ and thus A is invertible 
iff B is invertible. 

Proof Suppose B = C~ X AC. Then \B\ = | C'UC | = \ACC~ X \ = \A\. 



Trace Suppose A = (a^j) G i? n . Then the trace is defined by trace(A) = ai ; i + 
«2,2 + • • • + a n ,n- That is, the trace of A is the sum of its diagonal terms. 

One of the most useful properties of trace is trsice(AB) = trace(-BA) whenever AB 
and BA are defined. For example, suppose A = (ai, 02, ..., a n ) and 5 = (61, 6 2 , ..., 6 n )*. 
Then AS is the scalar ai&i + • • • + a n 6 n while 5 A is the n x n matrix (bi(ij). Note 
that tr&ce(AB) = trace(-BA). Here is the theorem in full generality. 

Theorem Suppose A G R m ,n and B G R n ,m- Then AB and -BA are square 
matrices with trace(Af?) = trace(BA). 

Proof This proof involves a change in the order of summation. By definition, 
trace (AB ) = ^ Oi,i&i,iH ho iin o nii = ^ a^b u = ^ &j,i«ij-l h6j, m o mjj = 



trace(-BA). 



l<j<m l<»<m l<i<" 

l<j<n 



Theorem If A, B G i?„, trace(A + B) = trace(A) + trace(i3) and 
trace(A-B) = trace(-BA). 

Proof The first part of the theorem is immediate, and the second part is a special 
case of the previous theorem. 

Theorem If A and B are similar, then trace(A) = trace(-B). 
Proof trace(5) = trace(C _1 i4C) = trace(AC , C" 1 ) = trace(A). 
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Summary Determinant and trace are functions from R n to R. Determinant is a 
multiplicative homomorphism and trace is an additive homomorphism. Furthermore 
| AB | = | BA | and tra.ce(AB) = trace(i3A). If A and B are similar, \A\ = \B\ and 
trace(A) = trace(-B). 

Exercise Suppose A G R n and a G R. Find \aA\ and trace(aA). 



Characteristic polynomials If A G R n , the characteristic polynomial CPa(x) G 
i?[x] is defined by CPa{x) = \ (xl — A) |. Any A G R which is a root of CPa(x) is 
called a characteristic root of A. 

Theorem CPa(o;) = oo + aix + • • • + a n -\x n ~ 1 + x n where trace(^4) = — a n _i 
and |A| = (-l) n o . 

Proof This follows from a direct computation of the determinant. 

Theorem If A and B are similar, then they have the same characteristic polyno- 
mials. 

Proof Suppose B = C~ l AC '. CP B (x) = \(xl- C^AC) | = | C" 1 ^/ - A)C \ = 
\{xI-A)\ = CP A {x). 

Exercise Suppose R is a commutative ring, A = I J is a matrix in R 2 , and 

CPa{x) = Oo + a,\X + x 2 . Find a and a,\ and show that a$I + ai^4 + yl 2 = 0, i.e., 
show A satisfies its characteristic polynomial. In other words, CPa{A) = 0. 

Exercise Suppose F is a field and A G Fi- Show the following are equivalent. 

1) A 2 = 0. 

2) | A |= trace(A) = 0. 

3) CP A (z) = x 2 . 

4) 3 an elementary matrix C such that C~ x AC is strictly upper triangular. 



Note This exercise is a special case of a more general theorem. A square matrix 
over a field is nilpotent iff all its characteristic roots are iff it is similar to a strictly 
upper triangular matrix. This remarkable result cannot be proved by matrix theory 
alone, but depends on linear algebra (see pages 93, 94, and 98). 



Chapter 5 

Linear Algebra 



The exalted position held by linear algebra is based upon the subject's ubiquitous 
utility and ease of application. The basic theory is developed here in full generality, 
i.e., modules are defined over an arbitrary ring R and not just over a field. The 
elementary facts about cosets, quotients, and homomorphisms follow the same pat- 
tern as in the chapters on groups and rings. We give a simple proof that if R is a 
commutative ring and / : R n — ► R n is a surjective 7?-module homomorphism, then 
/ is an isomorphism. This shows that finitely generated free i?-modules have a well 
defined dimension, and simplifies some of the development of linear algebra. It is in 
this chapter that the concepts about functions, solutions of equations, matrices, and 
generating sets come together in one unified theory. 

After the general theory, we restrict our attention to vector spaces, i.e., modules 
over a field. The key theorem is that any vector space V has a free basis, and thus 
if V is finitely generated, it has a well defined dimension, and incredible as it may 
seem, this single integer determines V up to isomorphism. Also any endomorphism 
/ : V —y V may be represented by a matrix, and any change of basis corresponds to 
conjugation of that matrix. One of the goals in linear algebra is to select a basis so 
that the matrix representing / has a simple form. For example, if / is not injective, 
then / may be represented by a matrix whose first column is zero. As another 
example, if / is nilpotent, then / may be represented by a strictly upper triangular 
matrix. The theorem on Jordan canonical form is not proved in this chapter, and 
should not be considered part of this chapter. It is stated here in full generality only 
for reference and completeness. The proof is given in the Appendix. This chapter 
concludes with the study of real inner product spaces, and with the beautiful theory 
relating orthogonal matrices and symmetric matrices. 
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Definition Suppose R is a ring and M is an additive abelian group. The state- 
ment that M is a right R-rnodule means there is a scalar multiplication 

M x R —y M satisfying (oi + ci2)r = a\r + a 2 r 

(m,r) —* mr a(?"i + r 2 ) = ox\ + ar 2 

a(n ■ r 2 ) = (ori)r 2 

a\ = a 

for all o, ai, ci2 £ M and r, ri, r 2 G i?. 

The statement that M is a /e/t R-module means there is a scalar multiplication 

R x M —y M satisfying r(oi + o 2 ) = ra\ + ra2 

(r,m) —y rm (r x + r 2 )a = r-^a + r 2 a 

(n -r 2 )o = ri(r 2 a) 

la = a 

Note that the plus sign is used ambiguously, as addition in M and as addition in R. 



Notation The fact that M is a right (left) i?-module will be denoted by M = Mr 
(M = rM). If R is commutative and M = Mr then left scalar multiplication defined 
by ra = ar makes M into a left i?-module. Thus for commutative rings, we may write 
the scalars on either side. In this text we stick to right i?-modules. 

Convention Unless otherwise stated, it is assumed that R is a ring and the word 
"R- module" (or sometimes just "module") means "right i?-module". 

Theorem Suppose M is an i?-module. 

1) If r G R, then / : M —y M defined by f(a) = or is a homomorphism of 
additive groups. In particular (Q M )r = Q M . 

2) If ae M, aO R = M . 

3) If a G M and r G R, then (—a)r = —(ar) = a(—r). 

Proof This is a good exercise in using the axioms for an i?-module. 
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Submodules If M is an i?-module, the statement that a subset iV C M is a 
submodule means it is a subgroup which is closed under scalar multiplication, i.e., if 
a £ N and r G R, then ar G AT. In this case A" will be an i?-module because the 
axioms will automatically be satisfied. Note that and M are submodules, called the 
improper submodules of M. 

Theorem Suppose M is an i?-module, T is an index set, and for each t G T, 
N t is a submodule of M. 

1) (~]N t * s a submodule of M. 

2) If {N t } is a monotonic collection, [J N t is a submodule. 

3) +t^rNt = {all finite sums oi + • ■ +o m : each a; belongs 
to some N t } is a submodule. If T = {1,2,.., n}, 
then this submodule may be written as 

N\ + N 2 + • • +Af n = {oi + a 2 + • • +a n : each Oj G A^}. 

Proof We know from page 22 that versions of 1) and 2) hold for subgroups, and 
in particular for subgroups of additive abelian groups. To finish the proofs it is only 
necessary to check scalar multiplication, which is immediate. Also the proof of 3) is 
immediate. Note that if Aq and N 2 are submodules of M, Ni + N 2 is the smallest 
submodule of M containing N\ U N 2 . 

Exercise Suppose T is a non-void set, A^ is an i?-module, and A^ T is the collection 
of all functions / : T — ► A" with addition defined by (f + g)(t) = f(t)+g(t), and scalar 
multiplication defined by (fr)(t) = f(t)r. Show A^ T is an R- module. (We know from 
the last exercise in Chapter 2 that A^ T is a group, and so it is only necessary to check 
scalar multiplication.) This simple fact is quite useful in linear algebra. For example, 
in 5) of the theorem below, it is stated that Hom#(M, N) forms an abelian group. 
So it is only necessary to show that Hom#(M, N) is a subgroup of N M . Also in 8) it 
is only necessary to show that Hom#(M, N) is a submodule of N M . 

Homomorphisms 



Suppose M and A^ are i?-modules. A function / : M — ► A^ is a homomorphism 
(i.e., an R- module homomorphism) provided it is a group homomorphism and if 
a G M and r G R, /(or) = f(a)r. On the left, scalar multiplication is in M and on 
the right it is in N. The basic facts about homomorphisms are listed below. Much 
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of this work has already been done in the chapter on groups (see page 28). 



Theorem 



1) The zero map M — ► A" is a homomorphism. 

2) The identity map / : M — ► M is a homomorphism. 

3) The composition of homomorphisms is a homomorphism. 

4) The sum of homomorphisms is a homomorphism. If f,g : M —>■ N are 
homomorphisms, define (/ + g) : M —>■ N by (/ + g)(a) = f(a) + g{o). 
Then / + g is a homomorphism. Also (— /) defined by (— /)(o) = —f{a) 
is a homomorphism. If ft, : A" — ► P is a homomorphism, 

h ° (/ + g) = (ft ° /) + (ft o g). If k : P — > M is a homomorphism, 
{f + g)ok=(fok) + (gok). 

5) Hom#(M, JV) = Hom(AfR, A/#), the set of all homomorphisms from M 
to iV, forms an abelian group under addition. Hom^(M, M), with 
multiplication defined to be composition, is a ring. 



6) 



7) 

8) 
9) 



If a bijection / : M — ► A^ is a homomorphism, then / _1 : A^ —>■ M is also 
a homomorphism. In this case / and f~ l are called isomorphisms. A 
homomorphism / : M — ► M is called an endomorphism. An isomorphism 
/ : M — >• M is called an automorphism. The units of the endomorphism 
ring Hom^(M , M) are the automorphisms. Thus the automorphisms on 
M form a group under composition. We will see later that if M = R n , 
Homfi(P n ,P n ) is just the matrix ring R n and the automorphisms 
are merely the invertible matrices. 

If R is commutative and r G R, then g : M — ► M defined by 5(0) = ar 
is a homomorphism. Furthermore, if / : M —>■ N is a homomorphism, 
/r defined by (fr)(a) = /(or) = f(a)r is a homomorphism. 

If P is commutative, Hom#(M , N) is an P-module. 

Suppose / : M —>■ N is a homomorphism, G C M is a submodule, 
and H C A^ is a submodule. Then /(G) is a submodule of A" 
and / _1 (P^) is a submodule of M. In particular, image(/) is a 
submodule of A" and ker(/) = / _1 (0) is a submodule of M. 



Proof This is just a series of observations. 
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Abelian groups are Z-modules On page 21, it is shown that any additive 
group M admits a scalar multiplication by integers, and if M is abelian, the properties 
are satisfied to make M a Z-module. Note that this is the only way M can be a Z- 
module, because ol = a, a2 = a + a, etc. Furthermore, if / : M — ► A" is a group 
homomorphism of abelian groups, then / is also a Z-module homomorphism. 

Summary Additive abelian groups are "the same things" as Z-modules. While 
group theory in general is quite separate from linear algebra, the study of additive 
abelian groups is a special case of the study of i?-modules. 

Exercise i?-modules are also Z-modules and i?-module homomorphisms are also 
Z-module homomorphisms. If M and A" are Q-modules and / : M — ► A" is a 
Z-module homomorphism, must it also be a Q-module homomorphism? 

Homomorphisms on R n 



R n as an f?-module On page 54 it was shown that the additive abelian 
group R m , n admits a scalar multiplication by elements in R. The properties listed 
there were exactly those needed to make R m ,n an i?-module. Of particular importance 
is the case R n = R © • • (BR = R n ,i (see page 53). We begin with the case n = 1. 

R as a right i?-module Let M = R and define scalar multiplication on the right 
by or = a ■ r. That is, scalar multiplication is just ring multiplication. This makes 
R a right R- module denoted by Rr (or just R). This is the same as the definition 
before for R n when n = 1. 

Theorem Suppose R is a ring and A^ is a subset of R. Then A^ is a submodule 
of R R ( R R) iff N is a right (left) ideal of R. 

Proof The definitions are the same except expressed in different language. 

Theorem Suppose M = Mr and f,g : R — ► M are homomorphisms with /(l) = 
g(l)- Then / = g. Furthermore, if m G M, 3! homomorphism h : R — ► M with 
h(l) = m. In other words, HomR(i?, M) pa M. 

Proof Suppose /(l) = g(l). Then /(r) = /(l • r) = /(l)r = ^(^r = o(l • r) = 
g(r). Given m G M, h : R —> M defined by h(r) = mr is a homomorphism. Thus 



72 



Linear Algebra Chapter 5 



evaluation at 1 gives a bijection from Hom#(i?, M) to M, and this bijection is clearly 
a group isomorphism. If R is commutative, it is an isomorphism of i?-modules. 

In the case M = R, the above theorem states that multiplication on left by some 
m G R defines a right i?-module homomorphism from R to R, and every module 
homomorphism is of this form. The element m should be thought of as a 1 x 1 
matrix. We now consider the case where the domain is R n . 



Homomorphisms on R n Define e^ G R n by e-i 



/Q \ 



l, 



/n \ 



. Note that any 



V Q / \ T n J 

can be written uniquely as e\T\ + • • +e n r n . The sequence {e l7 ..,e n } is called the 
canonical free basis or standard basis for R n . 



Theorem Suppose M = Mr and /, g : R n — ► M are homomorphisms with 
f{ e i) — g{ e i) for 1 < i < n. Then / = g. Furthermore, if mi,m2, ■■■,Tn n G M, 3! 
homomorphism h : R n — > M with /i(ej) = m, for 1 < i < m. The homomorphism 



h is defined by h{e\ri 



Cn.i r. 



m 1 r 1 -\ Ym n r n . 



Proof The proof is straightforward. Note this theorem gives a bijection from 
Hom#(i? n , M) to M n = M x M x • • xM and this bijection is a group isomorphism. 
We will see later that the product M n is an i?-module with scalar multiplication 
defined by (mi,m2, ..,m n )r = (mir, m2r, ..,m n r). If R is commutative so that 
Hom^(i? n , M) is an f?-module, this theorem gives an i?-module isomorphism from 
Rom R (R n , M) to M n . 

This theorem reveals some of the great simplicity of linear algebra. It does not 
matter how complicated the ring R is, or which i?-module M is selected. Any 
-R-module homomorphism from R n to M is determined by its values on the basis, 
and any function from that basis to M extends uniquely to a homomorphism from 
R n to M. 



Exercise Suppose R is a field and / 
Show / is injective. 



Rr —y M is a non-zero homomorphism. 
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Now let's examine the special case M = R m and show Hom^i?™, R m ) ta R ri 

Theorem Suppose A = (a i:j ) G R m ,n- Then / : R n -> R m denned by f(B) = AB 
is a homomorphism with /(e«) = column i of A. Conversely, if v\, . . . , v n G R m , define 
A G R m , n t° be the matrix with column i = v^. Then / defined by f(B) = AB is 
the unique homomorphism from R n to R m with /(ej) = i> ;. 

Even though this follows easily from the previous theorem and properties of ma- 
trices, it is one of the great classical facts of linear algebra. Matrices over R give 
i?-module homomorphisms! Furthermore, addition of matrices corresponds to addi- 
tion of homomorphisms, and multiplication of matrices corresponds to composition 
of homomorphisms. These properties are made explicit in the next two theorems. 

Theorem If /*, c/ : R n — > R m are given by matrices A,C G R m ,nj then / + g is 
given by the matrix A + C. Thus Hom#(i? n , R m ) and R m ,n are isomorphic as additive 
groups. If R is commutative, they are isomorphic as i?-modules. 

Theorem If / : R n — > R m is the homomorphism given by A G R m ,n and g : 
R m —y R p is the homomorphism given by C G R p , m , then g o f : R n — > R p is given by 
C A G R Py n- That is, composition of homomorphisms corresponds to multiplication 
of matrices. 

Proof This is just the associative law of matrix multiplication, C(AB) = (CA)B. 

The previous theorem reveals where matrix multiplication comes from. It is the 
matrix which represents the composition of the functions. In the case where the 
domain and range are the same, we have the following elegant corollary. 

Corollary Hom^(i? n , R n ) and R n are isomorphic as rings. The automorphisms 
correspond to the invertible matrices. 

This corollary shows one way non- commutative rings arise, namely as endomor- 
phism rings. Even if R is commutative, R n is never commutative unless n = 1. 

We now return to the general theory of modules (over some given ring R). 
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Cosets and Quotient Modules 

After seeing quotient groups and quotient rings, quotient modules go through 
without a hitch. As before, R is a ring and module means i?-module. 

Theorem Suppose M is a module and JV C Af is a submodule. Since A is a 
normal subgroup of M, the additive abelian quotient group M/N is defined. Scalar 
multiplication defined by (a + N)r = (ar + A) is well-defined and gives M/N the 
structure of an i?-module. The natural projection tx : M — ► M/N is a surjective 
homomorphism with kernel N. Furthermore, if / : M — ► M is a surjective homomor- 
phism with ker(/) = A, then M/N « M (see below). 

Proof On the group level, this is all known from Chapter 2 (see pages 27 and 29). 
It is only necessary to check the scalar multiplication, which is obvious. 



The relationship between quotients and homomorphisms for modules is the same 
as for groups and rings, as shown by the next theorem. 

Theorem Suppose / : M — ► M is a homomorphism and A is a submodule of M . 
If A c ker(/), then / : (M/N) -► M defined by /(a + A) = f(a) is a well-defined 
homomorphism making the following diagram commute. 

/ 

M M 



TV 



f 

M/N 

Thus defining a homomorphism on a quotient module is the same as defining a homo- 
morphism on the numerator that sends the denominator to 0. The image of / is the 
image of /, and the kernel of / is ker(f)/A. Thus if A = ker( /), / is injective, and 
thus (M/N) pa image(f). Therefore for any homomorphism /, (domain(/)/ker(/)) ~ 
mrage(f). 

Proof On the group level this is all known from Chapter 2 (see page 29). It is 
only necessary to check that / is a module homomorphism, and this is immediate. 
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Theorem Suppose M is an i?-module and K and L are submodules of M. 

i) The natural homomorphism K — > (if + L)/L is surjective with kernel 
K C\L. Thus (K/K C\ L) ^ (K + L)/L is an isomorphism. 

ii) Suppose K C L. The natural homomorphism M/K — ► M/L is surjective 
with kernel L/if. Thus (M/K)/(L/K) -^ M/L is an isomorphism. 

Examples These two examples are for the case R = Z, i.e., for abelian groups. 

1) M = Z K = 3Z L = 5Z LfnL = 15Z K + L = Z 
K/K C\L = 3Z/15Z » Z/5Z = (K + L)/L 

2) M = Z X = 6Z L = 3Z (K C L) 

(M/K)/(L/K) = (Z/6Z)/(3Z/6Z) w Z/3Z = M/L 

Products and Coproducts 



Infinite products work fine for modules, just as they do for groups and rings. 
This is stated below in full generality, although the student should think of the finite 
case. In the finite case something important holds for modules that does not hold 
for non-abelian groups or rings - namely, the finite product is also a coproduct. This 
makes the structure of module homomorphisms much more simple. For the finite 
case we may use either the product or sum notation, i.e., M 1 x M 2 x • • xM n = 
M l ®M 2 ®-- ®M n 



1 n ■ 



Theorem Suppose T is an index set and for each t G T, M t is an i?-module. On 
the additive abelian group J\ M t = n M t define scalar multiplication by {rrit}r = 

{mtr}. Then Y\M t is an i?-module and, for each s G T, the natural projection 
7T S : n Mt —y M s is a homomorphism. Suppose M is a module. Under the natural 1-1 
correspondence from {functions / : M — ► []M} to {sequence of functions {ft}ter 
where f t :M—y M t }, f is a homomorphism iff each f t is a homomorphism. 

Proof We already know from Chapter 2 that / is a group homomorphism iff each 
ft is a group homomorphism. Since scalar multiplication is defined coordinatewise, 
/ is a module homomorphism iff each f t is a module homomorphism. 
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Definition If T is finite, the coproduct and product are the same module. If T 
is infinite, the coproduct or sum TT M t = @ M t = ©M t is the submodule of n M t 

consisting of all sequences {m t } with only a finite number of non-zero terms. For 
each s E T, the inclusion homomorphisms i s : M s —y ©M t is defined by i s (a) = {a t } 
where a t = if t ^ s and a s = a. Thus each M s may be considered to be a submodule 

of eMj. 

Theorem Suppose M is an i?-module. There is a 1-1 correspondence from 

{homomorphisms g : ®M t —y M} and {sequences of homomorphisms {gt}t£T where 

g t : M t —y M} . Given g, g t is defined by g t = g o i t . Given {gt}, g is defined by 

g({mt}) = ^^gti^nit). Since there are only a finite number of non-zero terms, this 

t 
sum is well defined. 

For T = {1,2} the product and sum properties are displayed in the following 
commutative diagrams. 



M 




Mi 



"i 



M 






Mi M 2 M 2 Mi ^+ Mi M 2 - — : — M 2 

7T2 " %\ " %2 



Theorem For finite T, the 1-1 correspondences in the above theorems actually 
produce group isomorphisms. If R is commutative, they give isomorphisms of R- 
modules. 



Hom R (M,Mi 
Hom i? (M 1 • 



®M n ,M) 



Hom i? (M,Mi) 
Hom R (Mi,M) 



)Hom R (M, M n ) 

)Hom H (M n ,M) 



and 



Proof Let's look at this theorem for products with n = 2. All it says is that if / = 
(/i> f2) an d /i = (hi, h 2 ), then / + h = (fi + hi, f 2 + h 2 ). If R is commutative, so that 
the objects are i?-modules and not merely additive groups, then the isomorphisms 
are module isomorphisms. This says merely that fr = (fi, f 2 )r = (fir, f 2 r). 
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Exercise Suppose M and iV are i?-modules. Show that M (B N is isomorphic to 
N ® M. Now suppose A C M, B C N are submodules and show (M ®N)/(A® B) 
is isomorphic to (M/A) © (N/B). In particular, if a G i? and b E R, then 
(i? © R)/(aR © 6i?) is isomorphic to (R/aR) © (R/bR). For example, the abelian 
group (Z © Z)/(2Z © 3Z) is isomorphic to Z 2 © Z 3 . These isomorphisms are trans- 
parent and are used routinely in algebra without comment (see Th 4, page 118). 

Exercise Suppose R is a commutative ring, M is an i?-module, and n > 1. Define 
a function a : Hom#(i? n ,M) — ► M n which is a i?-module isomorphism. 

Summands 



One basic question in algebra is "When does a module split as the sum of two 
modules?" . Before defining summand, here are two theorems for background. 

Theorem Consider Mi = Mi©0 as a submodule of MiQ)M 2 . Then the projection 
map 7T2 : Mi © M 2 — > M 2 is a surjective homomorphism with kernel Mi. Thus 
(Mi © M 2 )/Mi is isomorphic to M 2 . (See page 35 for the group version.) 

This is exactly what you would expect, and the next theorem is almost as intuitive. 

Theorem Suppose K and L are submodules of M and / : K © L — ► M is the 
natural homomorphism, f(k,l) = k + 1. Then the image of / is K + L and the 
kernel of / is {(a, —a) : a G K n L}. Thus / is an isomorphism iff K + L = M and 
ifnL = 0. In this case we write X © L = M. This abuse of notation allows us to 
avoid talking about "internal" and "external" direct sums. 

Definition Suppose K is a submodule of M. The statement that K is a summand 
of M means 3 a submodule L of M with X © L = M. According to the previous 
theorem, this is the same as there exists a submodule L with K + L = M and 
X n L = 0. If such an L exists, it need not be unique, but it will be unique up to 
isomorphism, because L ps M/K. Of course, M and are always summands of M. 

Exercise Suppose M is a module and K = {(m,m) : m G M} C M © M. Show 
if is a submodule of M © M which is a summand. 

Exercise R is a module over Q, and Q C R is a submodule. Is Q a summand of 
R? With the material at hand, this is not an easy question. Later on, it will be easy. 
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Exercise Answer the following questions about abelian groups, i.e., Z- modules. 
(See the third exercise on page 35.) 

1) Is 2Z a summand of Z? 

2) Is 2Z 4 a summand of Z 4 ? 

3) Is 3Zi2 a summand of Z12? 

4) Suppose m, n > 1. When is nZ mn a summand of Z r -- ? 



J »rm ■ 



Exercise If T is a ring, define the center of T to be the subring {t : ts = 
st for all s G T}. Let i? be a commutative ring and T = R n . There is a exercise 
on page 57 to show that the center of T is the subring of scalar matrices. Show R n 
is a left T- module and find Hom T (i? n , R n ). 

Independence, Generating Sets, and Free Basis 



This section is a generalization and abstraction of the brief section Homomor- 
phisms on R n . These concepts work fine for an infinite index set T because linear 
combination means finite linear combination. However, to avoid dizziness, the student 
should first consider the case where T is finite. 

Definition Suppose M is an i?-module, T is an index set, and for each t G T, 
s t G M. Let S be the sequence {s t } tgT = {s t }. The statement that S is dependent 
means 3 a finite number of distinct elements t\, ...,t n i n T, and elements r l7 ..,r n in 
R, not all zero, such that the linear combination s tl ri + • • +s tn r n = 0. Otherwise, 
S is independent. Note that if some s t = 0, then S is dependent. Also if 3 distinct 
elements t\ and £2 in T with s tl = s t2 , then S is dependent. 

Let SR be the set of all linear combinations s^ri + • • +St n r n . SR is a submodule 
of M called the submodule generated by S. If 5 is independent and generates M, 
then S is said to be a fraszs or /ree basis for M. In this case any v G M can be written 
uniquely as a linear combination of elements in S. An i?-module M is said to be a 
free i?-module if it is zero or if it has a basis. The next two theorems are obvious, 
except for the confusing notation. You might try first the case T = {1,2, ...,n} and 
®Rt = R n (see p 72). 

Theorem For each t G T, let Rt = Rr and for each c G T, let e c G (BRt = (J) Rt 

teT 
be e c = {r t } where r c = 1 and r 4 = if t 7^ c. Then {e c } cS T is a basis for ©i? t called 
the canonical basis or standard basis. 
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Theorem Suppose iV is an i?-module and M is a free i?-module with a basis 
{st}- Then 3 a 1-1 correspondence from the set of all functions g: {st} — ► N and the 
set of all homomorphisms / : M — > N. Given g, define / by /(s^ri + • • +s tn r n ) = 
g(s tl )ri + • • +g(s tn )r n . Given /, define g by g(s t ) = /(«<)■ In other words, / is 
completely determined by what it does on the basis S, and you are "free" to send the 
basis any place you wish and extend to a homomorphism. 

Recall that we have already had the preceding theorem in the case S is the canon- 
ical basis for M = R n (p 72). The next theorem is so basic in linear algebra that it 
is used without comment. Although the proof is easy, it should be worked carefully. 

Theorem Suppose N is a module, M is a free module with basis S = {s t }, and 
/ : M —y N is a homomorphism. Let f(S) be the sequence {/(«*)} in N. 

1) f(S) generates N iff / is surjective. 

2) f(S) is independent in N iff / is injective. 

3) f(S) is a basis for N iff / is an isomorphism. 

4) If h : M —y N is a homomorphism, then f = h iff f \ S = h\ S. 

Exercise Let (A 1 ,..,A n ) be a sequence of n vectors with each A^ G Z n . 

Show this sequence is linearly independent over Z iff it is linearly independent over Q. 
Is it true the sequence is linearly independent over Z iff it is linearly independent 
over R? This question is difficult until we learn more linear algebra. 

Characterization of Free Modules 



Any free R- module is isomorphic to one of the canonical free R- modules (BRt- 
This is just an observation, but it is a central fact in linear algebra. 

Theorem A non-zero i?-module M is free iff 3 an index set T such that 

M pa (J) R t . In particular, M has a finite free basis of n elements iff M th R n . 



Proof If M is isomorphic to (BRt then M is certainly free. So now suppose M 
has a free basis {st}- Then the homomorphism / : M — y (BRt with f(st) = e t sends 
the basis for M to the canonical basis for (BRt- By 3) in the preceding theorem, / is 
an isomorphism. 



80 Linear Algebra Chapter 5 



Exercise Suppose R is a commutative ring, A G R n , and the homomorphism 
/ : R n — ► i? n denned by /(.B) = A-B is surjective. Show / is an isomorphism, i.e., 
show A is invertible. This is a key theorem in linear algebra, although it is usually 
stated only for the case where R is a field. Use the fact that {ei, .., e n } is a free basis 
for R n . 

The next exercise is routine, but still informative. 

Exercise Let R = Z, A = i and /: Z 3 — ► Z 2 be the group homo- 

morphism defined by A. Find a non-trivial linear combination of the columns of A 
which is 0. Also find a non-zero element of kernel (/). 

If R is a commutative ring, you can relate properties of R as an i?-module to 
properties of R as a ring. 

Exercise Suppose R is a commutative ring and v G R, v ^ 0. 

1) v is independent iff v is 

2) v is a basis for i? iff v generates R iff v is 



Note that 2) here is essentially the first exercise for the case n = 1. That is, if 
f : R ^ R is a surjective i?-module homomorphism, then / is an isomorphism. 



Relating these concepts to matrices 

The theorem stated below gives a summary of results we have already had. It 
shows that certain concepts about matrices, linear independence, injective homo- 
morphisms, and solutions of equations, are all the same — they are merely stated in 
different language. Suppose A G R m ,n and / : R n — > R m is the homomorphism associ- 
ated with A, i.e., f(B) = AB. Let Vi, .., v n G R m be the columns of A, i.e., /(e^) = i>j 

h \ ( ci 

column i of A. Let B = j . ] represent an element of i?" and C 
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represent an element of R m . 
Theorem 

1) The element f(B) is a linear combination of the columns of A, that is 
f(B) = /(ei&i -\ \-e n K) = Vih -\ Yv n b n . Thus the image of / 

is generated by the columns of A. (See bottom of page 89.) 

2) {vi, ..,v n } generates R m iff / is surjective iff (for any C G R m , AX = C 
has a solution). 

3) {vi, .., v n } is independent iff / is injective iff AX = has a unique 
solution iff (3 C G R m such that AX = C has a unique solution). 

4) {vi, .., v n } is a basis for R m iff / is an isomorphism iff (for any C G R m , 
AX = C has a unique solution). 

Relating these concepts to square matrices 

We now look at the preceding theorem in the special case where n = m and R 
is a commutative ring. So far in this chapter we have just been cataloging. Now we 
prove something more substantial, namely that if / : R n — ► R n is surjective, then / 
is injective. Later on we will prove that if R is a field, injective implies surjective. 

Theorem Suppose R is a commutative ring, A G R n , and / : R n — > R n is defined 
by f(B) = AB. Let w l7 ..,v n G R n be the columns of A, and Wi, ..,w n G R n = i?i ; „ 
be the rows of A. Then the following are equivalent. 

1) / is an automorphism. 

2) A is invertible, i.e., | A | is a unit in R. 

3) {vi, .., v n } is a basis for R n . 

4) {vi, ..,v n } generates R n . 

5) / is surjective. 

2*) A 1 is invertible, i.e., | A 1 \ is a unit in R. 
3*) {wi, ..,w n } is a basis for R n . 
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4*) {u>i, ..,w n } generates R n . 

Proof Suppose 5) is true and show 2). Since / is onto, 3 Ui,...,u n G R n with 
f(ui) = ej. Let g : R n — > R n be the homomorphism satisfying g{ti) = U{. Then fog 
is the identity. Now g comes from some matrix D and thus AD = I. This shows that 
A has a right inverse and is thus invertible. Recall that the proof of this fact uses 
determinant, which requires that R be commutative (see the exercise on page 64). 

We already know the first three properties are equivalent, 4) and 5) are equivalent, 
and 3) implies 4). Thus the first five are equivalent. Furthermore, applying this 
result to A 1 shows that the last three properties are equivalent to each other. Since 
| A | = | A f |, 2) and 2*) are equivalent. 



Uniqueness of Dimension 



There exists a ring R with R 2 « R 3 as i?-modules, but this is of little interest. If 
R is commutative, this is impossible, as shown below. First we make a convention. 

Convention For the remainder of this chapter, R will be a commutative ring. 

Theorem If / : R m — > R n is a surjective i?-module homomorphism, then m > n. 

Proof Suppose k = n — m is positive. Define h : (R m © R k = R n ) — ► R n by 
h(u,v) = f(u). Then h is a surjective homomorphism, and by the previous section, 
also injective. This is a contradiction and thus m > n. 

Corollary If / : R m — > R n is an isomorphism, then m = n. 

Proof Each of / and f~ x is surjective, so m = n by the previous theorem. 



Corollary If {v\, ..,v m } generates R n , then m>n. 

Proof The hypothesis implies there is a surjective homomorphism R r 
this follows from the first theorem. 



R n . So 



Lemma Suppose M is a f.g. module (i.e., a finite generated R- module). Then 
if M has a basis, that basis is finite. 
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Proof Suppose U C M is a finite generating set and S is a basis. Then any 
element of U is a finite linear combination of elements of S, and thus S is finite. 

Theorem Suppose M is a f.g. module. If M has a basis, that basis is finite 
and any other basis has the same number of elements. This number is denoted by 
dim(M), the dimension of M. (By convention, is a free module of dimension 0.) 

Proof By the previous lemma, any basis for M must be finite. M has a basis of 
n elements iff M ~ R n . The result follows because R n pa R m iff n = m. 

Change of Basis 



Before changing basis, we recall what a basis is. Previously we defined generat- 
ing, independence, and basis for sequences, not for collections. For the concept of 
generating it matters not whether you use sequences or collections, but for indepen- 
dence and basis, you must use sequences. Consider the columns of the real matrix 

/ 2 3 2 \ 
A = , , , 1 . If we consider the column vectors of A as a collection, there are 

only two of them, yet we certainly don't wish to say the columns of A form a basis for 
R 2 . In a set or collection, there is no concept of repetition. In order to make sense, 
we must consider the columns of A as an ordered triple of vectors, and this sequence 
is dependent. In the definition of basis on page 78, basis is defined for sequences, not 
for sets or collections. 

Two sequences cannot begin to be equal unless they have the same index set. 
Here we follow the classical convention that an index set with n elements will be 
{1,2, ..,n}, and thus a basis for M with n elements is a sequence S = {ui,..,u n } 
or if you wish, S = (ui,..,u n ) G M n . Suppose M is an i?-module with a basis of 
n elements. Recall there is a bijection a : Hom^(i? n , M) — ► M n defined by a(h) = 
(h(ei), .., h(e n )). Now h : R n — ► M is an isomorphism iff a(h) is a basis for M. 

Summary The point of all this is that selecting a basis of n elements for M 
is the same as selecting an isomorphism from R n to M, and from this viewpoint, 
change of basis can be displayed by the diagram below. 

Endomorphisms on R n are represented by square matrices, and thus have a de- 
terminant and trace. Now suppose M is a f.g. free module and / : M — ► M is a 
homomorphism. In order to represent / by a matrix, we must select a basis for M 
(i.e., an isomorphism with R n ). We will show that this matrix is well defined up to 
similarity, and thus the determinant, trace, and characteristic polynomial of / are 
well-defined. 
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Definition Suppose M is a free module, S = {ui,..,u n } is a basis for M, and 
/ : M —y M is a homomorphism. The matrix A = (a^j) G R n of / w.r.t. the basis 
S is defined by /(«») = wi«i,i + • • +-u n fln,j- (Note that if M = i?" and Wj = e*, ^4 is 
the usual matrix associated with /). 

Theorem Suppose T = {v\,..,v n } is another basis for M and B G i?„ is the 
matrix of / w.r.t. T. Define C = (cjj) G i? n by v j = U\C\^ + • • +u n c n ^. Then C is 
invertible and B = C~ l AC, i.e., A and i3 are similar. Therefore \A\ = \B\, 
trace(A)=trace(i?), and A and B have the same characteristic polynomial (see page 
66 of chapter 4). 

Conversely, suppose C = (qj) G R n is invertible. Define T = {v\,..,v n } by 
Vi = U\C\ t i + • • +u n c n ^. Then T is a basis for M and the matrix of / w.r.t. T is 
B = C~ l AC. In other words, conjugation of matrices corresponds to change of basis. 

Proof The proof follows by seeing that the following diagram is commutative. 



R r ' 



C 



R n 



B 




M 



f 



M 




A 



R n 





C 



R n 



The diagram also explains what it means for A to be the matrix of / w.r.t. the 
basis S. Let h : R n —>■ M be the isomorphism with h{ti) = Ui for 1 < i < n. Then 
the matrix A G R n is the one determined by the endomorphism h~ l ofoh : R n —y R n . 
In other words, column i of A is h~ l (f(h(ei))). 

An important special case is where M = R n and / : R n —y R n is given by some 
matrix W. Then h is given by the matrix U whose i th column is U{ and A = 
U~ l WU. In other words, W represents / w.r.t. the standard basis, and U~ l WU 
represents/ w.r.t. the basis {ui, ..,u n }. 



Definition Suppose M is a f.g. free module and / : M —y M is a homomorphism. 
Define |/| to be \A\, trace(/) to be trace(A), and CPf(x) to be CPa(x), where A is 
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the matrix of / w.r.t. some basis. By the previous theorem, all three are well-defined, 
i.e., do not depend upon the choice of basis. 



Exercise Let R = Z and / : Z 2 - -> Z 2 be denned by /(£>) 
Find the matrix of / w.r.t. the basis 





Exercise Let L C R 2 be the line L = {(r, 2r)' : r G R}. Show there is one 
and only one homomorphism / : R 2 — ► R 2 which is the identity on L and has 
/((— 1, 1)*) = (1,-1)*. Find the matrix A G R2 which represents / with respect 
to the basis {(1, 2)*, (— 1, 1)'}. Find the determinant, trace, and characteristic 
polynomial of /. Also find the matrix B G R2 which represents / with respect to 
the standard basis. Finally, find an invertible matrix C G R2 with B = C~ l AC. 

Vector Spaces 



So far in this chapter we have been developing the theory of linear algebra in 
general. The previous theorem, for example, holds for any commutative ring R, but 
it must be assumed that the module M is free. Endomorphisms in general will not 
have a determinant, trace, or characteristic polynomial. We now focus on the case 
where R is a field F, and show that in this case, every F-module is free. Thus any 
finitely generated F-module will have a well-defined dimension, and endomorphisms 
on it will have well-defined determinant, trace, and characteristic polynomial. 

In this section, F is a field. F-modules may also be called vector spaces and 
F-module homomorphisms may also be called linear transformations. 

Theorem Suppose M is an F-module and v G M. Then v ^ iff v is independent. 
That is, if v G V and r G F, vr = implies v = in M or r = in F. 



Proof Suppose vr = and r^O. Then 0= (vr)r l = vl 



v. 



Theorem Suppose M ^ is an F-module and v G M. Then v generates M iff v 
is a basis for M. Furthermore, if these conditions hold, then M w Fp, any non-zero 
element of M is a basis, and any two elements of M are dependent. 
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Proof Suppose v generates M. Then v ^ and is thus independent by the 
previous theorem. In this case M pa F, and any non-zero element of F is a basis, and 
any two elements of F are dependent. 

Theorem Suppose M ^ is a finitely generated F-module. If 5 = {i>i,..,f m } 
generates M, then any maximal independent subsequence of S is a basis for M. Thus 
any finite independent sequence can be extended to a basis. In particular, M has a 
finite free basis, and thus is a free F-module. 

Proof Suppose, for notational convenience, that {vi,..,v n } is a maximal inde- 
pendent subsequence of S, and n < i < m. It must be shown that Vi is a linear 
combination of {vi,..,v n }. Since {i>i, .., v n , Vi} is dependent, 3 r-y, ...,r n ,ri not all 

zero, such that V\T\-\ \-v n r n + Viri = 0. Then n ^ and t>j = — (v-\,r\-\ Vv n r n )r~ l . 

Thus {vi, .., v n } generates S and thus all of M. Now suppose T is a finite indepen- 
dent sequence. T may be extended to a finite generating sequence, and inside that 
sequence it may be extended to a maximal independent sequence. Thus T extends 
to a basis. 

After so many routine theorems, it is nice to have one with real power. It not 
only says any finite independent sequence can be extended to a basis, but it can be 
extended to a basis inside any finite generating set containing it. This is one of the 
theorems that makes linear algebra tick. The key hypothesis here is that the ring 
is a field. If R = Z, then Z is a free module over itself, and the element 2 of Z is 
independent. However it certainly cannot be extended to a basis. Also the finiteness 
hypothesis in this theorem is only for convenience, as will be seen momentarily. 



Since F is a commutative ring, any two bases of M must have the same number 
of elements, and thus the dimension of M is well defined (see theorem on page 83). 

Theorem Suppose M is an F-module of dimension n, and {vi, ...,v m } is an 
independent sequence in M. Then m <n and if m = n, {v±, .., v m } is a basis. 

Proof {vi, .., v m } extends to a basis with n elements. 
The next theorem is just a collection of observations. 
Theorem Suppose M and N are finitely generated F-modules. 
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1) M « F n iff dim(M) = n. 

2) M sa iV iff dim(M) = dim(AT). 

3) F m sa F" iff n = m. 

4) dim(M ®iV)= dim(M) + dim(JV). 



Here is the basic theorem for vector spaces in full generality. 

Theorem Suppose M ^ is an F-module and S = {v t }t^T generates M. 

1) Any maximal independent subsequence of S is a basis for M. 

2) Any independent subsequence of S may be extended to a maximal 
independent subsequence of S, and thus to a basis for M. 

3) Any independent subsequence of M can be extended to a basis for M. 
In particular, M has a free basis, and thus is a free F-module. 

Proof The proof of 1) is the same as in the case where S is finite. Part 2) will 
follow from the Hausdorff Maximality Principle. An independent subsequence of S is 
contained in a maximal monotonic tower of independent subsequences. The union of 
these independent subsequences is still independent, and so the result follows. Part 
3) follows from 2) because an independent sequence can always be extended to a 
generating sequence. 

Theorem Suppose M is an F-module and K C M is a submodule. 

1) if is a summand of M, i.e., 3 a submodule L of M with K © L = M . 

2) If M is f.g., then dim(if) < dim(M) and K = M iff dim(K) = dim(M). 

Proof Let T be a basis for K. Extend T to a basis S for M. Then S — T generates 
a submodule L with K © L = M. Part 2) follows from 1). 

Corollary Q is a summand of R. In other words, 3 a Q-submodule V C R 
with Q © V = R as Q-modules. (See exercise on page 77.) 

Proof Q is a field, R is a Q-module, and Q is a submodule of R. 

Corollary Suppose M is a f.g. F-module, iV is an F-module, and / : M — ► A^ 
is a homomorphism. Then dim(M) = dim(ker(/)) + dim(image(/)). 
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Proof Let K = ker(/) and L C M be a submodule with if © L = M. Then 
/ | L : L — ► image (/) is an isomorphism. 

Exercise Suppose i? is a domain with the property that, for i?-modules, every 
submodule is a summand. Show R is a field. 

Exercise Find a free Z-module which has a generating set containing no basis. 

Exercise The real vector space R 2 is generated by the sequence S = 

{(n, 0), (2, 1), (3, 2)}. Show there are three maximal independent subsequences of 
S, and each is a basis for R 2 . (Row vectors are used here just for convenience.) 

The real vector space R 3 is generated by S = {(1, 1, 2), (1, 2, 1), (3, 4, 5), (1, 2, 0)}. 
Show there are three maximal independent subsequences of S and each is a basis 
for R 3 . You may use determinant. 



Square matrices over fields 

This theorem is just a summary of what we have for square matrices over fields. 

Theorem Suppose A G F n and / : F n -»■ F n is defined by f(B) = AB. Let 
Vi,..,v n G F n be the columns of A, and W\,..,w n G F n = F\ jn be the rows of A. 
Then the following are equivalent. 

1) {vi,..,v n } is independent, i.e., / is injective. 

2) {vi, .., v n } is a basis for F n , i.e., / is an automorphism, i.e., A is 
invertible, i.e., | A |^ 0. 

3) {vi, ..,v n } generates F n , i.e., / is surjective. 
1*) {u>i, ..,u> n } is independent. 

2*) {u>i, ..,w n } is a basis for F n , i.e., A* is invertible, i.e., | A 1 |^ 0. 
3*) {wi, ..,«)„} generates F". 
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Proof Except for 1) and 1*), this theorem holds for any commutative ring R. 
(See the section Relating these concepts to square matrices, pages 81 and 82.) 
Parts 1) and 1*) follow from the preceding section. 

Exercise Add to this theorem more equivalent statements in terms of solutions 
of n equations in n unknowns. 

Overview Suppose each of X and Y is a set with n elements and / : X — ► Y is a 
function. By the pigeonhole principle, / is injective iff / is bijective iff / is surjective. 
Now suppose each of U and V is a vector space of dimension n and / : U — ► V is 
a linear transformation. It follows from the work done so far that / is injective iff 
/ is bijective iff / is surjective. This shows some of the simple and definitive nature 
of linear algebra. 

Exercise Let A = (A\, .., A n ) be an nxn matrix over Z with column i = Ai E 
Z n . Let / : Z" -► Z" be defined by f(B) = AB and / : R n -»■ R n be defined by 
f(C) = AC. Show the following are equivalent. (See the exercise on page 79.) 

1) / : Z n -y Z n is injective. 

2) The sequence (Ai, ...A n ) is linearly independent over Z. 

3) \A\^0. 

4) / : R" — *• R" is injective. 

5) The sequence (Ai, ..,A n ) is linearly independent over R. 



Rank of a matrix Suppose A G F m ^ n . The row (column) rank of A is defined 
to be the dimension of the submodule of F n (F m ) generated by the rows (columns) 
of A. 

Theorem If C G F m and D G F n are invertible, then the row (column) rank of 
A is the same as the row (column) rank of CAD. 

Proof Suppose / : F n — ► F m is defined by f(B) = AB. Each column of A 
is a vector in the range F m , and we know from page 81 that each f(B) is a linear 
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combination of those vectors. Thus the image of / is the submodule of F m generated 
by the columns of A, and its dimension is the column rank of A. This dimension 
is the same as the dimension of the image of g o f o h : F n — ► F m , where h is any 
automorphism on F n and g is any automorphism on F m . This proves the theorem 
for column rank. The theorem for row rank follows using transpose. 

Theorem If A G F m ^ ni the row rank and the column rank of A are equal. This 
number is called the rank of A and is < mm{m,n}. 

Proof By the theorem above, elementary row and column operations change 
neither the row rank nor the column rank. By row and column operations, A may be 
changed to a matrix H where ft, 1;1 = •• = h^t = 1 and all other entries are (see the 
first exercise on page 59). Thus row rank = t = column rank. 

Exercise Suppose A has rank t. Show that it is possible to select t rows and t 
columns of A such that the determined t x t matrix is invertible. Show that the rank 
of A is the largest integer t such that this is possible. 

Exercise Suppose A G F m ^ n has rank t. What is the dimension of the solution 
set of AX = 0? 

Definition If N and M are finite dimensional vector spaces and / : iV — ► M is a 
linear transformation, the rank of / is the dimension of the image of /. If / : F n — ► F m 
is given by a matrix A, then the rank of / is the same as the rank of the matrix A. 

Geometric Interpretation of Determinant 



Suppose V C R n is some nice subset. For example, if n = 2, V might be the 
interior of a square or circle. There is a concept of the n-dimensional volume of V . 
For n = 1, it is length. For n = 2, it is area, and for n = 3 it is "ordinary volume". 
Suppose A G R n and / : R" — ► R" is the homomorphism given by A. The volume of 
V does not change under translation, i.e., V and V +p have the same volume. Thus 
f(V) and f{V + p) = f(V) + f(p) have the same volume. In street language, the next 
theorem says that "/ multiplies volume by the absolute value of its determinant" . 

Theorem The n-dimensional volume of f(V) is ±|A|(the n-dimensional volume 
of V). Thus if \A\ — ±1, / preserves volume. 
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Proof If \A\ = 0, image(/) has dimension < n and thus f(V) has n- dimensional 
volume 0. If \A\ ^ then A is the product of elementary matrices (see page 59) 
and for elementary matrices, the theorem is obvious. The result follows because the 
determinant of the composition is the product of the determinants. 

Corollary If P is the n-dimensional parallelepiped determined by the columns 
Vi, .. , v n of A, then the n-dimensional volume of P is ±|A|. 

Proof Let V = [0, 1] x • • x[0, 1] = {dh + • • +e n t n : < t % < 1}. Then 
P = fiY) = {vih + • • +v n t n : < U < 1}. 

Linear functions approximate different iable functions locally 



We continue with the special case F = R. Linear functions arise naturally in 
business, science, and mathematics. However this is not the only reason that linear 
algebra is so useful. It is a central fact that smooth phenomena may be approx- 
imated locally by linear phenomena. Without this great simplification, the world 
of technology as we know it today would not exist. Of course, linear transforma- 
tions send the origin to the origin, so they must be adjusted by a translation. As 
a simple example, suppose h : R — > R is differentiable and p is a real number. Let 
/ : R — ► R be the linear transformation f(x) = h'(p)x. Then h is approximated near 
p by g(x) = h{p) + f(x — p) = h{p) + h'{p)(x — p). 

Now suppose V C R 2 is some nice subset and h = (hi, fi2) : V — > R 2 is injective 

/ dhx dhi \ 

and differentiable. Define the Jacobian by J(h)(x,y) = gfi 2 gfi 2 J and for each 

\ dx dy / 

(x,y) G V, let f(x,y) : R 2 — > R 2 be the homomorphism defined by J(h)(x,y). 
Then for any (pi,P2) £ V, h is approximated near (pi,P2) (after translation) by 

/(p l7 2 ). The area of V is / / ldxdy. From the previous section we know that 

any homomorphism / multiplies area by | / |. The student may now understand 
the following theorem from calculus. (Note that if h is the restriction of a linear 
transformation from R 2 to R 2 , this theorem is immediate from the previous section.) 

Theorem Suppose the determinant of J(h)(x,y) is non- negative for each 

(x,y) G V. Then the area of h(V) is / / | J(h) \ dxdy. 
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The Transpose Principle 

We now return to the case where F is a field (of arbitrary characteristic). F- 
modules may also be called vector spaces and submodules may be called subspaces. 
The study of i?-modules in general is important and complex. However the study of 
F-modules is short and simple - every vector space is free and every subspace is a 
summand. The core of classical linear algebra is not the study of vector spaces, but 
the study of homomorphisms, and in particular, of endomorphisms. One goal is to 
show that if / : V — ► V is a homomorphism with some given property, there exists 
a basis of V so that the matrix representing / displays that property in a prominent 
manner. The next theorem is an illustration of this. 

Theorem Let F be a field and n be a positive integer. 

1) Suppose V is an n-dimensional vector space and / : V — ► V is a 
homomorphism with |/| = 0. Then 3 a basis of V such that the matrix 
representing / has its first row zero. 

2) Suppose A E F n has |A| = 0. Then 3 an invertible matrix C such that 
C~ l AC has its first row zero. 

3) Suppose V is an n-dimensional vector space and / : V — >■ V is a 
homomorphism with |/| = 0. Then 3 a basis of V such that the matrix 
representing / has its first column zero. 

4) Suppose A G F n has |A| = 0. Then 3 an invertible matrix D such that 
D~ X AD has its first column zero. 

We first wish to show that these 4 statements are equivalent. We know that 
1) and 2) are equivalent and also that 3) and 4) are equivalent because change of 
basis corresponds to conjugation of the matrix. Now suppose 2) is true and show 
4) is true. Suppose |^| = 0. Then \A l \ = and by 2) 3 C such that C~ l A l C has 
first row zero. Thus (C~ 1 A t C) t = C^y^C 4 ) -1 has first row column zero. The result 
follows by defining D = (C') _1 . Also 4) implies 2). 

This is an example of the transpose principle. Loosely stated, it is that theorems 
about change of basis correspond to theorems about conjugation of matrices and 
theorems about the rows of a matrix correspond to theorems about the columns of a 
matrix, using transpose. In the remainder of this chapter, this will be used without 
further comment. 
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Proof of the theorem We are free to select any of the 4 parts, and we select 
part 3). Since | / |= 0, / is not injective and 3 a non-zero v\ G V with f(v\) = 0. 
Now v i is independent and extends to a basis {vi, .., v „}. Then the matrix of / w.r.t 
this basis has first column zero. 

Exercise Let A = I ) ■ Find an invertible matrix C G R2 so that C~ l AC 

I \ 

has first row zero. Also let A = 1 3 4 and find an invertible matrix D G R 3 

\2 1 4/ 
so that D X AD has first column zero. 

Exercise Suppose M is an n-dimensional vector space over a field F, k is an 
integer with < k < n, and / : M —y M is an endomorphism of rank k. Show 
there is a basis for M so that the matrix representing / has its first n — k rows zero. 
Also show there is a basis for M so that the matrix representing / has its first n — k 
columns zero. Work these out directly without using the transpose principle. 

Nilpotent Homomorphisms 



In this section it is shown that an endomorphism / is nilpotent iff all of its char- 
acteristic roots are iff it may be represented by a strictly upper triangular matrix. 



Definition An endomorphism / : V — > V is nilpotent if 3 m with f m = 0. Any 
/ represented by a strictly upper triangular matrix is nilpotent (see page 56). 

Theorem Suppose V is an n-dimensional vector space and / : V — ► V is a 
nilpotent homomorphism. Then f n = and 3 a basis of V such that the matrix 
representing / w.r.t. this basis is strictly upper triangular. Thus the characteristic 
polynomial of / is CPf(x) = x n . 

Proof Suppose / 7^ is nilpotent. Let t be the largest positive integer with 
/* ^ 0. Then f\V) C f _1 (V) C •• C f(V) C V. Since / is nilpotent, all of these 
inclusions are proper. Therefore t < n and f n = 0. Construct a basis for V by 
starting with a basis for /*(V), extending it to a basis for f l ~ l (V), etc. Then the 
matrix of / w.r.t. this basis is strictly upper triangular. 

Note To obtain a matrix which is strictly lower triangular, reverse the order of 
the basis. 
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Exercise Use the transpose principle to write 3 other versions of this theorem. 



Theorem Suppose V is an n-dimensional vector space and / : V — > V is a 
homomorphism. Then / is nilpotent iff CP/(x) = x n . (See the exercise at the end 
of Chapter 4 for the case n = 2.) 

Proof Suppose CPf(x) = x n . For n = 1 this implies / = 0, so suppose n > 1. 
Since the constant term of CPf(x) is 0, the determinant of / is 0. Thus 3 a basis 
of V such that the matrix A representing / has its first column zero. Let B e F n -i 
be the matrix obtained from A by removing its first row and first column. Now 



CP A {x) 



x 



xCP B {x). Thus CP B (x) 



X 



n-1 



and by induction on n, B is 



nilpotent and so 3 C such that C X BC is strictly upper triangular. Then 



/ 1 








o\ 



c- 



/o 



\ 



B 



\0 J \0 

is strictly upper triangular. 



/ 1 




Vo 







o\ 



c 



/o 





Vo 



C~ l BC 



Exercise Suppose F is a field, A G F 3 is a strictly lower triangular matrix of 

/ \ 
rank 2, and B= 1 0. Using conjugation by elementary matrices, show there 

\0 1 OJ 

is an invertible matrix C so that C l AC = B. Now suppose V is a 3-dimensional 
vector space and / : V — ► V is a nilpotent endomorphism of rank 2. We know / can 
be represented by a strictly lower triangular matrix. Show there is a basis {v\, v 2, v 3} 
for V so that i3 is the matrix representing /. Also show that /(i>i) = i>2, /(^2) = ^3, 
and /(V3) = 0. In other words, there is a basis for V of the form {w , f(v), f 2 (v )} 
with f(v) =0. 



Exercise Suppose V is a 3-dimensional vector space and / : V — ► V is a nilpotent 
endomorphism of rank 1. Show there is a basis for V so that the matrix representing 


/ is j 1 
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Eigenvalues 

Our standing hypothesis is that V is an n- dimensional vector space over a field F 
and / : V — > V is a homomorphism. 

Definition An element A G F is an eigenvalue of / if 3 a non-zero d£ V with 
/(i>) = Ai>. Any such v is called an eigenvector. E\ G V is defined to be the set of 
all eigenvectors for A (plus 0). Note that E\ = ker(A7 — /) is a subspace of V . The 
next theorem shows the eigenvalues of / are just the characteristic roots of /. 

Theorem If A £ F then the following are equivalent. 

1) A is an eigenvalue of /, i.e., (XI — f) : V — ► V is not injective. 

2) | (XI -f) |=0. 

3) A is a characteristic root of /, i.e., a root of the characteristic 
polynomial CPf(x) = \ (xl — A) |, where A is any matrix representing /. 

Proof It is immediate that 1) and 2) are equivalent, so let's show 2) and 3) 
are equivalent. The evaluation map F[x] — ► F which sends h(x) to h(X) is a ring 
homomorphism (see theorem on page 47). So evaluating (xl — A) at x = X and 
taking determinant gives the same result as taking the determinant of (xl — A) and 
evaluating at x = X. Thus 2) and 3) are equivalent. 

The nicest thing you can say about a matrix is that it is similar to a diagonal 
matrix. Here is one case where that happens. 

Theorem Suppose A l7 .., X k are distinct eigenvalues of /, and Vi is an eigenvector 
of X{ for 1 < i < k. Then the following hold. 

1) {vi,..,v k } is independent. 

2) If k = n, i.e., if CPf(x) = (x — X\) ■ ■ ■ (x — A n ), then {vi, .., v n } is a 
basis for V. The matrix of / w.r.t. this basis is the diagonal matrix whose 
(i,i) term is Aj. 

Proof Suppose {i>i, .., i^} is dependent. Suppose t is the smallest positive integer 
such that {vi,..,Vt} is dependent, and V\Ti + • • +v t r t = is a non-trivial linear 
combination. Note that at least two of the coefficients must be non-zero. Now 
(/ - A t )(t>iri H \-v t r t ) = i>i(Ai - \ t )ri H \-v t -i(X t -i - X t )r t -i + = is a shorter 
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non-trivial linear combination. This is a contradiction and proves 1). Part 2) follows 
from 1) because dim(V) = n. 

Exercise Let A = I J £ R2. Find an invertible C E C2 such that 

C~ l AC is diagonal. Show that C cannot be selected in R 2 . Find the characteristic 
polynomial of A. 

Exercise Suppose V is a 3-dimensional vector space and / : V — ► V is an endo- 
morphism with CPf(x) = (x — A) 3 . Show that (/ — XI) has characteristic polynomial 
x 3 and is thus a nilpotent endomorphism. Show there is a basis for V so that the 

A0 0\/A0 0\ / A 

matrix representing / is jlA0,lA0|or|0A0 

1 A / \ A J \ A 

We could continue and finally give an ad hoc proof of the Jordan canonical form, 
but in this chapter we prefer to press on to inner product spaces. The Jordan form 
will be developed in Chapter 6 as part of the general theory of finitely generated 
modules over Euclidean domains. The next section is included only as a convenient 
reference. 

Jordan Canonical Form 



This section should be just skimmed or omitted entirely. It is unnecessary for the 
rest of this chapter, and is not properly part of the flow of the chapter. The basic 
facts of Jordan form are summarized here simply for reference. 

The statement that a square matrix B over a field F is a Jordan block means that 
3 A G F such that B is a lower triangular matrix of the form 



B 



/A \ 

1 A 



{0 1 \j 



B gives a homomorphism g : F m — ► F m with g(e m ) = Xe T 



and g{ti) = ej + i + Ae^ for 1 < i < m. Note that CPb(x) = (x — A) m and so A is the 
only eigenvalue of B, and B satisfies its characteristic polynomial, i.e., CPb(B) = 0. 
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Definition A matrix D G F n is in Jordan form if 3 Jordan blocks B 1 , .. , B t such 



Mi 



that D 



Bo 



V 



\ 



Suppose D is of this form and Bi G F rii has 



Bt 



eigenvalue Aj. Then n\ + ■ ■ +n t = n and CPd(x) = (x — Ai)™ 1 • -(x — A t ) n *. Note that 
a diagonal matrix is a special case of Jordan form. D is a diagonal matrix iff each 
rii = 1, i.e., iff each Jordan block is a 1 x 1 matrix. 

Theorem If A G F n , the following are equivalent. 

1) 3 an invertible C G F n such that C~ l AC is in Jordan form. 

2) 3 Ai, .., A n G .F (not necessarily distinct) such that CPa(x) = (x — Ai) • • 
(x — A n ). (In this case we say that all the eigenvalues of A belong to F.) 

Theorem Jordan form (when it exists) is unique. This means that if A and D are 
similar matrices in Jordan form, they have the same Jordan blocks, except possibly 
in different order. 

The reader should use the transpose principle to write three other versions of the 
first theorem. Also note that we know one special case of this theorem, namely that 
if A has n distinct eigenvalues in F, then A is similar to a diagonal matrix. Later on 
it will be shown that if A is a symmetric real matrix, then A is similar to a diagonal 
matrix. 

Let's look at the classical case A G R n . The complex numbers are algebraically 
closed. This means that CPa{x) will factor completely in C[x], and thus 3 C G C n 
with C~ l AC in Jordan form. C may be selected to be in R n iff all the eigenvalues 
of A are real. 

Exercise Find all real matrices in Jordan form that have the following charac- 
teristic polynomials: x(x — 2), (x — 2) 2 , (x — 2)(x — 3)(x — 4), (x — 2)(x — 3) 2 , 
(x - 2)\x - 3) 2 , (x-2)(x-3) 3 . 



Exercise Suppose D G F n is in Jordan form and has characteristic polynomial 
a + a ± x -\ \-x n . Show a I + a x D -\ \-D n = 0, i.e., show CP D (D) = 0. 
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Exercise (Cayley-Hamilton Theorem) Suppose E is a field and A G E n . 
Assume the theorem that there is a field F containing E such that CPa{x) factors 
completely in F[x]. Thus 3 an invertible C G F n such that D = C~ x AC is in Jordan 
form. Use this to show CPa{A) = 0. (See the second exercise on page 66.) 

Exercise Suppose A G F n is in Jordan form. Show A is nilpotent iff A n = 
iff CPa(x) = x n . (Note how easy this is in Jordan form.) 

Inner Product Spaces 



The two most important fields for mathematics and science in general are the 
real numbers and the complex numbers. Finitely generated vector spaces over R or 
C support inner products and are thus geometric as well as algebraic objects. The 
theories for the real and complex cases are quite similar, and both could have been 
treated here. However, for simplicity, attention is restricted to the case F = R. 
In the remainder of this chapter, the power and elegance of linear algebra become 
transparent for all to see. 

Definition Suppose V is a real vector space. An inner product (or dot product) 
on V is a function V x V — ► R which sends (u, v ) to u ■ v and satisfies 

1) {u\r\ + u 2 r 2 ) ■ v = (wi • v)r\ + (u 2 ■ v)r 2 for all u\, u 2 , v G V 
v ■ (wiri + u 2 r 2 ) = (v ■ U\)r\ + (v ■ u 2 )r 2 and r 1? r 2 G R. 

2) u ■ v = v ■ u for all u, v G V. 

3) u ■ u > and u ■ u = iff u = for all u G V . 

Theorem Suppose V has an inner product. 

1) If v G V, f : V —y R defined by f(u) = u ■ v is a homomorphism. 
Thus • v = 0. 

2) Schwarz' inequality. If w, v G V", (w • v) 2 < (u ■ u)(v ■ v ). 



Proof of 2) Let a = y/v ■ v and b = \fu ■ u. If a or b is 0, the result is obvious. 
Suppose neither a nor b is 0. Now < (ua ± vb) ■ (ua ± i>6) = (u ■ u)a 2 ± 2ab(u ■ v)+ 
(v-v)b 2 = b 2 a 2 ±2ab(u-v) + a 2 b 2 . Dividing by 2ab yields < ab± (u-v) or | u-v |< ab. 
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Theorem Suppose V has an inner product. Define the norm or length of a vector 
v by ||i>|| = y/v ■ v. The following properties hold. 

1) ||v|| =0 iff v = 0. 

2) \\vr\\ = \\v\\ | r |. 

3) | u ■ v | < ||-u||||f ||. (Schwarz' inequality) 

4) ||-u + u||<||-u|| + ||w||. (The triangle inequality) 



Proof of 4) ||w + v || = (u + v) • (u + v) = \\u\\ + 2(u ■ v) + ||w|| 2 < 



a 



2 \\u \\\\v\\ + \\v\\ = (\\u\\ + \\v 



2 _ /IL.II i IL,II\2 



Definition An Inner Product Space (IPS) is a real vector space with an 

inner product. Suppose V is an IPS. A sequence {vi,..,v n } is orthogonal provided 
Vi ■ v j = when i ^ j. The sequence is orthonormal if it is orthogonal and each 
vector has length 1, i.e., Vi ■ Vj = 8^j for 1 < i, j < n. 

Theorem If S = {v\, ..,v n } is an orthogonal sequence of non-zero vectors in an 

(Vi v n \ 

71 — m"5 ' ' ' > 71 — m" r i s orthonormal. 
Fill F„||J 

Proof Suppose v-yr-y + • • +v n r n = 0. Then = (vyr-y + • • +v n r n ) ■ v t = r^fj • Vi) 
and thus r^ = 0. Thus S is independent. The second statement is transparent. 

It is easy to define an inner product, as is shown by the following theorem. 

Theorem Suppose V is a real vector space with a basis S = {vy,..,v n }. Then 
there is a unique inner product on V which makes S an orthornormal basis. It is 
given by the formula {vyry + • • +v n r n ) ■ (vySy + • • +v n s n ) = rySy + • • +r n s n . 

Convention R™ will be assumed to have the standard inner product defined by 
(ry, .., r n y ■ (sy, .., s n Y = rySy + • • +r n s n . S = {ey, .., e n } will be called the canonical 
or standard orthonormal basis (see page 72). The next theorem shows that this 
inner product has an amazing geometry. 

Theorem If u,v G R", u • v = \\u\\\\v\\ cos where is the angle between u 
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and v. 

Proof Let u = (ry, ..,r n ) and v = (sy, .., s n ). By the law of cosines ||w — v\\ 2 = 
\\u\\ 2 + ||u|| 2 — 2||u||||v|| cos 0. So {r 1 — Sy) 2 + • • +(r n — s n ) 2 = r\ + • • +r 2 + s\ + • • 
+s 2 — 2||-u||||i>|| cos 0. Thus riSi + • • +r n s n = ||tt||||f || cos 0. 

Exercise This is a simple exercise to observe that hyperplanes in R n are cosets. 
Suppose / : R n — ► R is a non-zero homomorphism given by a matrix A = (ai, .., a n ) G 
Ri n . Then L = ker(/) is the set of all solutions to a^X\ + • • +a n x n = 0, i.e., the 

/ Cl \ 



set of all vectors perpendicular to A. Now suppose b G R and C 



G R r 



has f(C) = b. Then f~ Y (b) is the set of all solutions to a\X\ + • • +a n x n = b which 
is the coset L + C, and this the set of all solutions to oi(xi — c\)-\ \-a n (x n — c n ) = 0. 



Gram-Schmidt orthonormalization 

Theorem (Fourier series) Suppose W is an IPS with an orthonormal basis 
{wi,..,w n }. Then if v G W, v = Wi(v ■ Wi) + ■ • +w n (v ■ w n ). 

Proof v = w 1 r 1 + • • +w n r n and v ■ Wi = {w-yr-y + • • +w n r n ) ■w i = r i 

Theorem Suppose W is an IPS, Y C IT is a subspace with an orthonormal basis 

{wy, ..,Wk}, and v G W — Y. Define the projection of v onto T by p(v) = Wy(v-Wy) + -- 

+Wk(v-Wk), and let w = v—p(v). Then (w-Wi) = (v — wy(v-wy) — Wk(v-Wk))-Wi = 0. 

in 
Thus if Wk+y = M — n"5 then {wi, .., Wfe+ij is an orthonormal basis for the subspace 

\\w\\ 

generated by {wy, ..,Wk,v}. If {wy, ..,u>k,v} is already orthonormal, Wk+y = v. 

Theorem (Gram-Schmidt) Suppose W is an IPS with a basis {vy,..,v n }. 
Then W has an orthonormal basis {wy, ..,w n }. Moreover, any orthonormal sequence 
in W extends to an orthonormal basis of W. 

Proof Let Wy = - — -. Suppose inductively that {wy, ..,Wk} is an orthonormal 

Fill 
basis for Y, the subspace generated by {vy,..,Vk}. Let w = Vk+y — p{vk+y) and 
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w 
Wk+i = 7i — [?• Then by the previous theorem, {wi, ..,Wk+i} is an orthonormal basis 

IMI 

for the subspace generated by {u>i, ..,Wk,Vk+i}- In this manner an orthonormal basis 
for W is constructed. Notice that this construction defines a function h which sends 
a basis for W to an orthonormal basis for W (see topology exercise on page 103). 

Now suppose W has dimension n and {u>i, ..,Wk} is an orthonormal sequence in 
W. Since this sequence is independent, it extends to a basis {wi, ..,Wk, ffc+i, ■•, v n }. 
The process above may be used to modify this to an orthonormal basis {uq, ..,w n }. 



Exercise Let / : R 3 — > R be the homomorphism defined by the matrix (2,1,3). 
Find an orthonormal basis for the kernel of /. Find the projection of (ei + e 2 ) onto 
ker(/). Find the angle between e\ + e 2 and the plane ker(/). 

Exercise Let W = R 3 have the standard inner product and Y C W be the 

subspace generated by {wi,w 2 } where u>\ = (1,0,0)* and w 2 = (0,1,0)*. W is 

generated by the sequence {wi,w 2 ,v} where v = (1,2,3)*. As in the first theorem 

of this section, let w = v — p(v), where p(v) is the projection of v onto Y, and set 

w 
w 3 = 71 — 17- Find W3 and show that for any t with < t < 1, {wi, W2, (1 — t)v + tw^} 

IMI 

is a basis for W. This is a key observation for an exercise on page 103 showing 0(n) 
is a deformation retract of GL n (R). 



Isometries Suppose each of U and V is an IPS. A homomorphism / : U — ► V 
is said to be an isometry provided it is an isomorphism and for any Ui,u 2 in U, 
(«i • 1*2)1/ = (/(«i) • f(u 2 ))v- 

Theorem Suppose each of U and V is an n-dimensional IPS, {ui,..,u n } is an 
orthonormal basis for U, and / : U —>■ V is a homomorphism. Then / is an isometry 
iff {/(wi), ..,/(n„)} is an orthonormal sequence in V. 

Proof Isometries certainly preserve orthonormal sequences. So suppose T = 

{/(tti), ..,f(u n )} is an orthonormal sequence in V. Then T is independent and thus 
T is a basis for V and thus / is an isomorphism (see the second theorem on page 79). 
It is easy to check that / preserves inner products. 

We now come to one of the definitive theorems in linear algebra. It is that, up to 
isometry, there is only one inner product space for each dimension. 
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Theorem Suppose each of U and V is an n- dimensional IPS. Then 3 an isometry 
/ : U — ► V. In particular, U is isometric to R n with its standard inner product. 

Proof There exist orthonormal bases {ui,..,u n } for U and {vi,..,v n } for V. 
By the first theorem on page 79, there exists a homomorphism / : U — ► V with 
/(ttj) = fj, and by the previous theorem, / is an isometry. 

Exercise Let / : R 3 —>■ R be the homomorphism defined by the matrix (2,1,3). 
Find a linear transformation h : R 2 — ► R 3 which gives an isometry from R 2 to ker(/). 



Orthogonal Matrices 



As noted earlier, linear algebra is not so much the study of vector spaces as it is 
the study of endomorphisms. We now wish to study isometries from R n to R n . 

We know from a theorem on page 90 that an endomorphism preserves volume iff 
its determinant is ±1. Isometries preserve inner product, and thus preserve angle and 
distance, and so certainly preserve volume. 

Theorem Suppose A G R n and / : R n — ► R ra is the homomorphism defined by 
f(B) = AB. Then the following are equivalent. 

1) The columns of A form an orthonormal basis for R n , i.e., A 1 A = I . 

2) The rows of A form an orthonormal basis for R n , i.e., AA l = I. 

3) / is an isometry. 

Proof A left inverse of a matrix is also a right inverse (see the exercise on 
page 64). Thus 1) and 2) are equivalent because each of them says A is invert- 
ible with A~ x = A 1 . Now {ei, ..,e n } is the canonical orthonormal basis for R n , and 
f(ei) is column i of A. Thus by the previous section, 1) and 3) are equivalent. 

Definition If A 6 R n satisfies these three conditions, A is said to be orthogonal. 
The set of all such A is denoted by 0(n), and is called the orthogonal group. 

Theorem 

1) If A is orthogonal, | A |= ±1. 

2) If A is orthogonal, A~ l is orthogonal. If A and C are orthogonal, AC is 
orthogonal. Thus 0{n) is a multiplicative subgroup of GL n (H). 
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3) Suppose A is orthogonal and / is denned by f(B) = AB. Then / preserves 
distances and angles. This means that if u,v G R n , ||w — v\\ = 
\\f(u) — f(v)\\ and the angle between u and v is equal to the angle between 
/(«) and f(v). 

Proof Part 1) follows from |yl| 2 = \A\ \A t \ = \I\ = 1. Part 2) is imme- 
diate, because isometries clearly form a subgroup of the multiplicative group of 
all automorphisms. For part 3) assume / : R ra — ► R ra is an isometry. Then 

|| U _ V ||2 = („ - V ) . („ - V ) = /(„ - V ) ■ f(u -V) = \\f(u - V)\\ 2 = \\f(u) - f(v)\\ 2 . 

The proof that / preserves angles follows from u ■ v = ||«||||i>||cos©. 

Exercise Show that if A G 0(2) has \A\ = 1, then A = \ C ° S " ~ Sm I ) for 

y sinB cos0 J 

some number 0. (See the exercise on page 56.) 

Exercise (topology) Let R„ ~ R n have its usual metric topology. This means 
a sequence of matrices {Ai} converges to A iff it converges coordinatewise. Show 
GL n (R) is an open subset and 0(n) is closed and compact. Let h : GL n (R.) — ► 
0(n) be defined by Gram-Schmidt. Show H : GL n (R) x [0,1] -► GL n (R) defined 
by H(A,t) = (1 — t)A + th(A) is a deformation retract of GL n (R) to 0(n). 

Diagonalization of Symmetric Matrices 



We continue with the case F = R. Our goals are to prove that, if A is a symmetric 
matrix, all of its eigenvalues are real and that 3 an orthogonal matrix C such that 
C~ l AC is diagonal. As background, we first note that symmetric is the same as 
self- adjoint. 

Theorem Suppose A e R n and u,v G R". Then (A t u) ■ v = u ■ (Av). 

Proof If y, z G R n , then the dot product y ■ z, is the matrix product y t z, and 
matrix multiplication is associative. Thus (A 4 -u) • v = (u l A)v = -u*(Ai>) = u ■ (Av). 

Definition Suppose A G R n . A is said to be symmetric provided A 1 = A. Note 
that any diagonal matrix is symmetric. A is said to be self-adjoint if (Au)-v = u-(Av) 
for all u,v G R ra . The next theorem is just an exercise using the previous theorem. 

Theorem A is symmetric iff A is self-adjoint. 
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Theorem Suppose A G R ra is symmetric. Then 3 real numbers Ai,..,A n (not 
necessarily distinct) such that CPa(x) = (x — X\)(x — X2) • • • (x — X n ). That is, all 
the eigenvalues of A are real. 

Proof We know CPa(x) factors into linear s over C. If \x = o + bi is a complex 
number, its conjugate is defined by jl = a — bi. If h : C — ► C is defined by fo(/i) = /Z, 
then h is a ring isomorphism which is the identity on R. If w = (ciij) is a complex 
matrix or vector, its conjugate is defined by w = (a^j). Since A G R ra is a real 
symmetric matrix, A = A 1 = A 1 . Now suppose A is a complex eigenvalue of A 
and i> G C n is an eigenvector with Av = Aw. Then A(i>*-y) = (Xv) l v = (Av) t; v = 
(v f A)v = v\Av) = v t (Av) = w'(Aw) = X(v f v). Thus A = A and A G R. Or 
you can define a complex inner product on C n by (w ■ v) = w t: v. The proof then 
reads as \{v ■ v) = (Xv ■ v) = {Av ■ v) = (v ■ Av) = (v ■ \v) = X(v ■ v). Either way, 
A is a real number. 

We know that eigenvectors belonging to distinct eigenvalues are linearly indepen- 
dent. For symmetric matrices, we show more, namely that they are perpendicular. 

Theorem Suppose A is symmetric, Ai, A2 G R are distinct eigenvalues of A, and 
Au = X\U and Av = X2V. Then u ■ v — 0. 

Proof Xi(u ■ v) = (Au) ■ v = u ■ (Av) = X2(u ■ v) . 



Review Suppose A G R n and / : R n -> R n is defined by f(B) = AB. Then A 
represents / w.r.t. the canonical orthonormal basis. Let S = {vi,..,v n } be another 
basis and C G R n be the matrix with Vi as column i. Then C~ 1 AC is the matrix 
representing/ w.r.t. S. Now S is an orthonormal basis iff C is an orthogonal matrix. 

Summary Representing / w.r.t. an orthonormal basis is the same as conjugating 
A by an orthogonal matrix. 

Theorem Suppose A G R n and C G 0(n). Then A is symmetric iff C~ 1 AC 
is symmetric. 

Proof Suppose A is symmetric. Then (C^ACf = C^C" 1 )* = C~ X AC. 

The next theorem has geometric and physical implications, but for us, just the 
incredibility of it all will suffice. 
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Theorem If A 6 R n , the following are equivalent. 

1) A is symmetric. 

2) 3 C e 0(n) such that C~ l AC is diagonal. 



Proof By the previous theorem, 2) ^> 1). Show 1) =4> 2). Suppose A is a 
symmetric 2x2 matrix. Let A be an eigenvalue for A and {vi, v 2} be an orthonormal 
basis for R 2 with Av\ = \v\. Then w.r.t this basis, the transformation determined 

by A is represented by J . Since this matrix is symmetric, b = 0. 



Now suppose by induction that the theorem is true for symmetric matrices in 
R t for t < n, and suppose A is a symmetric n x n matrix. Denote by Ai, .., A& the 
distinct eigenvalues of A, k < n. If k = n, the proof is immediate, because then there 
is a basis of eigenvectors of length 1, and they must form an orthonormal basis. So 
suppose k < n. Let v\,..,Vk be eigenvectors for Ai,..,Afc with each || Vi ||= 1. They 
may be extended to an orthonormal basis v±, ..,v n . With respect to this basis, the 



/ / Ai 



transformation determined by A is represented by 






A fe / 



V 



(o) 



(D) J 



Since this is a symmetric matrix, B = and D is a symmetric matrix of smaller 
size. By induction, 3 an orthogonal C such that C~ X DC is diagonal. Thus conjugating 
' / 



by 



C 



makes the entire matrix diagonal. 



This theorem is so basic we state it again in different terminology. If V is an IPS, a 
linear transformation / : V — ► V is said to be self-adjoint provided (u-f(v)) = (f(u)-v) 
for all u,v &V . 



Theorem If V is an n-dimensional IPS and / : V — ► V is a linear transformation, 
then the following are equivalent. 

1) / is self-adjoint. 

2) 3 an orthonormal basis {v±, ..., v n } for V with each 
Vi an eigenvector of /. 
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Exercise Let A 



Do the same for A 




. Find an orthogonal C such that C l AC is diagonal. 



Exercise Suppose A, D <E R n are symmetric. Under what conditions are A and D 
similar? Show that, if A and D are similar, 3 an orthogonal C such that D = C~ l AC. 

Exercise Suppose V is an n-dimensional real vector space. We know that V is 
isomorphic to R n . Suppose / and g are isomorphisms from V to R n and A is a subset 
of V. Show that f(A) is an open subset of R n iff g(A) is an open subset of R". This 
shows that V , an algebraic object, has a god-given topology. Of course, if V has 
an inner product, it automatically has a metric, and this metric will determine that 
same topology. Finally, suppose V and W are finite- dimensional real vector spaces 
and h : V —>■ W is a linear transformation. Show that h is continuous. 



Exercise Define E : C n -»■ C n by £(A) = e A = / + A + {l/2\)A 2 + ■■. This series 
converges and thus E is a well defined function. If AB = BA, then E(A + B) = 
E(A)E(B). Since A and -A commute, / = E(0) = E(A - A) = E(A)E(-A), and 
thus E(A) is invertible with E(A)' 1 = E(-A). Furthermore E(A t ) = E(A)\ and 
if C is invertible, E(C~ l AC) = C~ 1 E(A)C. Now use the results of this section to 
prove the statements below. (For part 1, assume the Jordan form, i.e., assume any 
A G C n is similar to a lower triangular matrix.) 

1) If A e C n , then | e A |= e trace ( A ). Thus if A e R n , | e A |= 1 
iff trace(A) = 0. 

2) 3 a non-zero matrix N G R2 with e N = I. 

3) If N G R n is symmetric, then e N = I iff N = 0. 

4) If A G R n and A* = -A, then e A G 0(n). 
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Appendix 



The five previous chapters were designed for a year undergraduate course in algebra. 
In this appendix, enough material is added to form a basic first year graduate course. 
Two of the main goals are to characterize finitely generated abelian groups and to 
prove the Jordan canonical form. The style is the same as before, i.e., everything is 
right down to the nub. The organization is mostly a linearly ordered sequence except 
for the last two sections on determinants and dual spaces. These are independent 
sections added on at the end. 

Suppose R is a commutative ring. An i?-module M is said to be cyclic if it can 
be generated by one element, i.e., M « R/I where I is an ideal of R. The basic 
theorem of this chapter is that if R is a Euclidean domain and M is a finitely generated 
i?-module, then M is the sum of cyclic modules. Thus if M is torsion free, it is a 
free i?-module. Since Z is a Euclidean domain, finitely generated abelian groups 
are the sums of cyclic groups - one of the jewels of abstract algebra. 

Now suppose F is a field and V is a finitely generated F- module. If T : V — ► V is 
a linear transformation, then V becomes an F[a;]-module by defining vx = T(v). Now 
F[x] is a Euclidean domain and so Vp[x] is the sum of cyclic modules. This classical 
and very powerful technique allows an easy proof of the canonical forms. There is a 
basis for V so that the matrix representing T is in Rational canonical form. If the 
characteristic polynomial of T factors into the product of linear polynomials, then 
there is a basis for V so that the matrix representing T is in Jordan canonical form. 
This always holds if F = C. A matrix in Jordan form is a lower triangular matrix 
with the eigenvalues of T displayed on the diagonal, so this is a powerful concept. 

In the chapter on matrices, it is stated without proof that the determinant of the 
product is the product of the determinants. A proof of this, which depends upon the 
classification of certain types of alternating multilinear forms, is given in this chapter. 
The final section gives the fundamentals of dual spaces. 
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The Chinese Remainder Theorem 

On page 50 in the chapter on rings, the Chinese Remainder Theorem was proved 
for the ring of integers. In this section this classical topic is presented in full generality. 
Surprisingly, the theorem holds even for non- commutative rings. 

Definition Suppose R is a ring and A 1 ,A 2 ,..., A m are ideals of R. Then the sum 
A\ + A 2 + ■ ■ ■ + A m is the set of all a± + a 2 + ■ ■ ■ + a m with Oj G A*. The product 
A\A 2 ■ ■ ■ A rn is the set of all finite sums of elements a\a 2 ■ ■ ■ a m with a» G Aj. Note 
that the sum and product of ideals are ideals and A\A 2 ■ ■ ■ A m C (A\ n A 2 n • • • n A m ). 

Definition Ideals A and B of R are said to be comaximal if A + B = R. 

Theorem If A and B are ideals of a ring R, then the following are equivalent. 

1) A and B are comaximal. 

2) 3 a e A and be B with a + b = 1. 

3) ft (A) = R/B where -k : R — > i?/S is the projection. 

Theorem If Ai,A 2 , ...,A m and 5 are ideals of i? with Aj and -B comaximal for 
each i, then ^4i^2 ■ ■ ■ A m and i3 are comaximal. Thus AiC\ A 2 C\ ■ ■ ■ C\ A m and B 
are comaximal. 

Proof Consider tt : i? -»■ i?/B. Then 7r(^iA 2 ■ ■ ■ A m ) = 7r(Ai)7r(A 2 ) ■ ■ ■ ft(A m ) = 
(R/B)(R/B) ■ ■ ■ (R/B) = R/B. 

Chinese Remainder Theorem Suppose Ai,A 2 ,...,A n are pairwise comaximal 
ideals of R, with each A t ^ R. Then the natural map 7r : R — ► R/Ai x R/A 2 x • • • x 
R/A n is a surjective ring homomorphism with kernel Ai D A 2 D • • • D A n . 

Proof There exists a» G Ai and 6j G Ai A 2 ■ ■ ■ Ai_iAj + i • • • A n with cti + bi = 1 . Note 
that ft(bi) = (0, .., 0, lj, 0, ..,0). If (ri + Ai, r 2 + A 2 , ..., r n + A n ) is an element of the 
range, it is the image of ri&i+r 2 6 2 H Yr n b n = ri(l-ai) + r 2 (l-a 2 )H \-r n (l-a n ). 

Theorem If R is commutative and A 1 ,A 2 ,...,A n are pairwise comaximal ideals 
of R, then AiA 2 • • • A n = A x n A 2 n • • • n A n . 

Proof for n = 2. Show AifiA 2 c A\A 2 . 3 ai G Ai and a 2 G A 2 with ai + o 2 = 1. 
If c G Ai n A 2 , then c = c(oi + o 2 ) G AiA 2 . 
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Prime and Maximal Ideals and UFD S 

In the first chapter on background material, it was shown that Z is a unique 
factorization domain. Here it will be shown that this property holds for any principle 
ideal domain. Later on it will be shown that every Euclidean domain is a principle 
ideal domain. Thus every Euclidean domain is a unique factorization domain. 

Definition Suppose R is a commutative ring and I C R is an ideal. 

I is prime means I ^ R and if a, b G R have ab G /, then a or b G /. 

/ is maximal means I ^ R and there are no ideals properly between / and R. 

Theorem is a prime ideal of R iff R is 



is a maximal ideal of R iff R is 



Theorem Suppose J C R is an ideal, J ^ R. 
J is a prime ideal iff R/J is 



J is a maximal ideal iff R/J is 

Corollary Maximal ideals are prime. 
Proof Every field is a domain. 

Theorem If a G R is not a unit, then 3 a maximal ideal I of R with a G /. 

Proof This is a classical application of the Hausdorff Maximality Principle. Con- 
sider {J : J is an ideal of R containing a with J ^ R}. This collection contains a 

maximal monotonic collection {Vt}teT- The ideal V = I) Vt does not contain 1 and 

teT 
thus is not equal to R. Therefore V is equal to some Vt and is a maximal ideal 

containing o. 

Note To properly appreciate this proof, the student should work the exercise in 
group theory at the end of this section (see page 114). 



Definition Suppose R is a domain and a,b G R. Then we say a ~ b iff there 
exists a unit u with au = b. Note that ~ is an equivalence relation. If a ~ b, then a 
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and b are said to be associates. 

Examples If R is a domain, the associates of 1 are the units of R, while the only 
associate of is itself. If n G Z is not zero, then its associates are n and —n. 
If F is a field and g E F[x] is & non-zero polynomial, then the associates of g are 
all eg where c is a non-zero constant. 

The following theorem is elementary, but it shows how associates fit into the 
scheme of things. An element a divides b (a\b) if 3! c G R with ac = b. 

Theorem Suppose R is a domain and a,b G (R — 0). Then the following are 
equivalent. 

1) a ~ b. 

2) a\b and b\a. 

3) a# = bR. 

Parts 1) and 3) above show there is a bijection from the associate classes of R to 
the principal ideals of R. Thus if R is a PID, there is a bijection from the associate 
classes of R to the ideals of R. If an element of a domain generates a non-zero prime 
ideal, it is called a prime element. 

Definition Suppose R is a domain and a G R is a non-zero non-unit. 

1) a is irreducible if it does not factor, i.e., a = be =£■ 6 or c is a unit. 

2) a is prime if it generates a prime ideal, i.e., a\bc =>■ a|6 or a|c. 

Note If a is a prime and a\c\C2 ■ ■ ■ c n , then o|cj for some i. This follows from the 
definition and induction on n. If each Cj is irreducible, then a ~ q for some i. 

Note If a ~ o, then a is irreducible (prime) iff 6 is irreducible (prime). In other 
words, if a is irreducible (prime) and u is a unit, then aw is irreducible (prime). 

Note a is prime ^> a is irreducible. This is immediate from the definitions. 

Theorem Factorization into primes is unique up to order and associates, i.e., if 
d = b\b 2 ■ ■ ■ b n = C\C2 ■ ■ ■ c m with each 6j and each q prime, then n = m and for some 
permutation a of the indices, bi and c CT (j) are associates for every i. Note also 3 a unit 
u and primes Pi,p2, ■ ■ ■ ,Pt where no two are associates and du = p S ip s 2 2 ■ ■ -pT ■ 
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Proof This follows from the notes above. 

Definition R is a factorization domain (FD) means that R is a domain and if a is 
a non-zero non-unit element of R, then a factors into a finite product of irreducibles. 

Definition R is a unique factorization domain (UFD) means R is a FD in which 
factorization is unique (up to order and associates). 

Theorem If R is a UFD and a is a non-zero non-unit of R, then a is irreducible 
<^> a is prime. Thus in a UFD, elements factor as the product of primes. 

Proof Suppose R is a UFD, a is an irreducible element of R, and a\bc. If either 
6 or c is a unit or is zero, then a divides one of them, so suppose each of b and c is 
a non-zero non-unit element of R. There exists an element d with ad = be. Each of 
b and c factors as the product of irreducibles and the product of these products is 
the factorization of be. It follows from the uniqueness of the factorization of ad = be, 
that one of these irreducibles is an associate of a, and thus a\b or a\c. Therefore 
the element a is a prime. 

Theorem Suppose R is a FD. Then the following are equivalent. 

1) R is a UFD. 

2) Every irreducible element of R is prime, i.e., a irreducible -£4> a is prime. 

Proof We already know 1) =£• 2). Part 2) => 1) because factorization into primes 
is always unique. 

This is a revealing and useful theorem. If R is a FD, then R is a UFD iff each 
irreducible element generates a prime ideal. Fortunately, principal ideal domains 
have this property, as seen in the next theorem. 

Theorem Suppose R is a PID and a G R is non-zero non-unit. Then the following 
are equivalent. 

1) aR is a maximal ideal. 

2) aR is a prime ideal, i.e., a is a prime element. 

3) a is irreducible. 

Proof Every maximal ideal is a prime ideal, so 1) =4> 2). Every prime element is 
an irreducible element, so 2) =>■ 3). Now suppose a is irreducible and show aR is a 
maximal ideal. If 7 is an ideal containing aR, 3 b G R with 7 = bR. Since b divides 
a, the element b is a unit or an associate of a. This means I = R or 7 = aR. 
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Our goal is to prove that a PID is a UFD. Using the two theorems above, it 
only remains to show that a PID is a FD. The proof will not require that ideals be 
principally generated, but only that they be finitely generated. This turns out to 
be equivalent to the property that any collection of ideals has a "maximal" element. 
We shall see below that this is a useful concept which fits naturally into the study of 
unique factorization domains. 

Theorem Suppose R is a commutative ring. Then the following are equivalent. 

1) If I C R is an ideal, 3 a finite set {a±, a-i, ■■■, a n } C R such that I = 
a\R + a2-R + • • • + a n R, i.e., each ideal of R is finitely generated. 

2) Any non-void collection of ideals of R contains an ideal I which is maximal in 
the collection. This means if J is an ideal in the collection with J D I, then 

J = I. (The ideal I is maximal only in the sense described. It need not contain 
all the ideals of the collection, nor need it be a maximal ideal of the ring R.) 

3) If I\ C I2 C I3 C ... is a monotonic sequence of ideals, 3 to > 1 such that I t = h a 
for all t > to- 

Proof Suppose 1) is true and show 3). The ideal / = I\ U I2 U . . . is finitely 
generated and 3 to > 1 such that I to contains those generators. Thus 3) is true. Now 
suppose 2) is true and show 1). Let I be an ideal of R, and consider the collection 
of all finitely generated ideals contained in /. By 2) there is a maximal one, and it 
must be I itself, and thus 1) is true. We now have 2)=>1)=>3), so suppose 2) is false 
and show 3) is false. So there is a collection of ideals of R such that any ideal in the 
collection is properly contained in another ideal of the collection. Thus it is possible 
to construct a sequence of ideals I\ C I2 C I3 . . . with each properly contained in 
the next, and therefore 3) is false. (Actually this construction requires the Hausdorff 
Maximality Principle or some form of the Axiom of Choice, but we slide over that.) 

Definition If R satisfies these properties, R is said to be Noetherian, or it is said 
to satisfy the ascending chain condition. This property is satisfied by many of the 
classical rings in mathematics. Having three definitions makes this property useful 
and easy to use. For example, see the next theorem. 

Theorem A Noetherian domain is a FD. In particular, a PID is a FD. 

Proof Suppose there is a non-zero non-unit element that does not factor as the 
finite product of irreducibles. Consider all ideals dR where d does not factor. Since 
R is Noetherian, 3 a maximal one cR. The element c must be reducible, i.e., c = ab 
where neither a nor 6 is a unit. Each of aR and bR properly contains cR, and so each 



Chapter 6 Appendix 113 

of a and b factors as a finite product of irreducibles. This gives a finite factorization 
of c into irreducibles, which is a contradiction. 

Corollary A PID is a UFD. So Z is a UFD and if F is a field, F[x] is a UFD. 



You see the basic structure of UFD S is quite easy. It takes more work to prove 
the following theorems, which are stated here only for reference. 

Theorem If R is a UFD then R[xi,...,x n ] is a UFD. Thus if F is a field, 
F[xi, ...,x n ] is a UFD. (This theorem goes all the way back to Gauss.) 

If R is a PID, then the formal power series -R[[xi, ...,£„]] is a UFD. Thus if F 
is a field, F[[xi, ...,x n ]] is a UFD. (There is a UFD R where R[[x]] is not a UFD. 
See page 566 of Commutative Algebra by N. Bourbaki.) 

Theorem Germs of analytic functions on C n form a UFD. 

Proof See Theorem 6.6.2 of An Introduction to Complex Analysis in Several Vari- 
ables by L. Hormander. 

Theorem Suppose R is a commutative ring. Then R is Noetherian => R[xi, ...,x n ] 
and i?[[xi, ...,#„]] are Noetherian. (This is the famous Hilbert Basis Theorem.) 

Theorem If R is Noetherian and I C R is a proper ideal, then R/I is Noetherian. 
(This follows immediately from the definition. This and the previous theorem show 
that Noetherian is a ubiquitous property in ring theory.) 



Domains With Non-unique Factorizations Next are presented two of the 
standard examples of Noetherian domains that are not unique factorization domains. 



Exercise Let R = Z(\/5) = {n + my/5 : n,m G Z}. Show that R is a subring of 
R which is not a UFD. In particular 2 • 2 = (1 — y/h) • (— 1 — y/h) are two distinct 
irreducible factorizations of 4. Show R is isomorphic to Z[:r]/(:r 2 — 5), where (x 2 — 5) 
represents the ideal (x 2 — 5)Z[x], and R/(2) is isomorphic to Ta2\x\J(x 2 — [5]) = 
Z 2 [x]/(x 2 + [1]), which is not a domain. 
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Exercise Let R = H[x,y, z]/(x 2 — yz). Show x 2 — yz is irreducible and thus 
prime in R[x, y,z]. If u 6 R[x, y, z], let u € R be the coset containing u. Show R 
is not a UFD. In particular x ■ x = y ■ z are two distinct irreducible factorizations 
of x 2 . Show R/(x) is isomorphic to H[y,z]/(yz), which is not a domain. An easier 
approach is to let / : R[x,y,z] — ► R[x,y] be the ring homomorphism defined by 
f(x) = xy, f(y) = x 2 , and f(z) = y 2 . Then S = H[xy,x 2 ,y 2 ] is the image of 
/ and S is isomorphic to R. Note that xy, x 2 , and y 2 are irreducible in S and 
(xy)(xy) = (x 2 )(y 2 ) are two distinct irreducible factorizations of (xy) 2 in S. 

Exercise In Group Theory If G is an additive abelian group, a subgroup H 
of G is said to be maximal if H ^ G and there are no subgroups properly between 
H and G. Show that H is maximal iff G/H ~ Z p for some prime p. For simplicity, 
consider the case G = Q. Which one of the following is true? 

1) If a £ Q, then there is a maximal subgroup H of Q which contains a. 

2) Q contains no maximal subgroups. 

Splitting Short Exact Sequences 



Suppose B is an i?-module and K is a sub module of B. As defined in the chapter 
on linear algebra, K is a summand of B provided 3 a submodule L of B with 
K + L = B and K C\L = 0. In this case we write K © L = B. When is K a summand 
of Bl It turns out that if is a summand of B iff there is a splitting map from 
B/K to B. In particular, if B/K is free, K must be a summand of B. This is used 
below to show that if R is a PID, then every submodule of R n is free. 

Theorem 1 Suppose R is a ring, B and C are i?-modules, and g : B — > C is a 
surjective homomorphism with kernel if. Then the following are equivalent. 

1) if is a summand of B. 

2) g has a right inverse, i.e., 3 a homomorphism h : C — ► B with goh = I : C ^ C. 
(h is called a splitting map.) 

Proof Suppose 1) is true, i.e., suppose 3 a submodule L of B with K (B L = B. 
Then (g|L) : L — > C is an isomorphism. If i : L — ► £> is inclusion, then h defined 
by h = i o (g\L)~ l is a right inverse of g. Now suppose 2) is true and h : C — ► £> 
is a right inverse of g. Then /j is injective, if + /i(C) = B and if fl fo(C) = 0. 
Thus if © h(C) = B. 
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Definition Suppose / : A — ► B and g : B — ► C are i?-module homomorphisms. 

The statement that 0^^4^5-^C^O is a s/iort exact sequence (s.e.s) means 
/ is injective, g is surjective and f(A) = ker(g). The canonical split s.e.s. is A — > 
^4 © C — > C where f = i\ and g = tt 2 . A short exact sequence is said to split if 3 
an isomorphism 5 -^ A © C such that the following diagram commutes. 



f 9 
0^ A B C -^0 



7T 2 

A®C 

We now restate the previous theorem in this terminology. 

Theorem 1.1 A short exact sequence O^A^B^C^O splits iff f(A) is 
a summand of B, iff B — > C has a splitting map. If C is a free i?-module, there is 
a splitting map and thus the sequence splits. 

Proof We know from the previous theorem f(A) is a summand of B iff B — > C 
has a splitting map. Showing these properties are equivalent to the splitting of the 
sequence is a good exercise in the art of diagram chasing. Now suppose C has a free 
basis T G C, and g : B — ► C is surjective. There exists a function h : T ^ B such 
that g o h(c) = c for each c E T. The function ft, extends to a homomorphism from 
C to B which is a right inverse of g. 

Theorem 2 If R is a domain, then the following are equivalent. 

1) R is a P1D. 

2) Every submodule of i?# is a free i?- module of dimension < 1. 

This theorem restates the ring property of P1D as a module property. Although 
this theorem is transparent, 1)=^2) is a precursor to the following classical result. 

Theorem 3 If R is a PID and A C R n is a submodule, then A is a free i?-module 
of dimension < n. Thus subgroups of Z n are free Z-modules of dimension < n. 

Proof From the previous theorem we know this is true for n = 1. Suppose n > 1 
and the theorem is true for submodules of R n ~ l . Suppose A c R n is a submodule. 
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Consider the following short exact sequences, where / : R n ~ l — y R n ~ 1 (BR is inclusion 
and g = 7T : i? n_1 © R — y R is the projection. 

— ► RJ 1 ' 1 -U i?™" 1 ®R^ R — ► 

— y A n fT" 1 — y 4 — y 7r(A) — > 

By induction, A fl i? n_1 is free of dimension < n — 1. If 7r(A) = 0, then A C i? n_1 . 
If 7r(A) 7^ 0, it is free of dimension 1 and thus the sequence splits by Theorem 1.1. 
In either case, A is a free submodule of dimension < n. 

Exercise Let A C Z 2 be the subgroup generated by {(6,24), (16,64)}. Show A 

x3 

is a free Z-module of dimension 1. Also show the s.e.s. Z4 — ► Z12 — ► Z3 splits 
but Z — y Z — y Z2 and Z2 — ► Z4 — y Z2 do not (see top of page 78). 

Euclidean Domains 



The ring Z possesses the Euclidean algorithm and the polynomial ring F[x] has 
the division algorithm (pages 14 and 45). The concept of Euclidean domain is an 
abstraction of these properties, and the efficiency of this abstraction is displayed in 
this section. Furthermore the first axiom, 4>{a) < 4>(ab), is used only in Theorem 
2, and is sometimes omitted from the definition. Anyway it is possible to just play 
around with matrices and get some deep results. If R is a Euclidean domain and M 
is a finitely generated i?-module, then M is the sum of cyclic modules. This is one of 
the great classical theorems of abstract algebra, and you don't have to worry about 
it becoming obsolete. Here N will denote the set of all non-negative integers, not 
just the set of positive integers. 

Definition A domain R is a Euclidean domain provided 3 cf> : (R — Q) — ► N such 
that if a, b 6 (R — 0), then 

1) 0(a) < 0(o6). 

2) 3 q, r e R such that a = bq + r with r = or 0(r) < (f)(b). 

Examples of Euclidean Domains 

Z with 4>{n) = \n\. 

A field F with (j)(a) = 1 Va^O or with 0(a) = Va/0. 

F[x] where F is a field with 4>{f = a$ + a\X + • • • + a n x n ) = deg(/). 

Z[i] = {a + bi : a, b G Z} = Gaussian integers with 4>{a + bi) = a 2 + b 2 . 
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Theorem 1 If R is a Euclidean domain, then R is a PID and thus a UFD. 

Proof If I is a non-zero ideal, then 3 6 G 7 — satisfying 0(6) < 0(a) V a G 7 — 0. 
Then 6 generates I because if a G I — Q, 3 q,r with a = bq + r. Now r £ I and 
r^0^> 0(r) < 0(6) which is impossible. Thus r = and a EbR so I = bR. 

Theorem 2 If R is a Euclidean domain and a, 6 G i? — 0, then 

0(1) is the smallest integer in the image of 0. 

a is a unit in R iff 0(a) = 0(1). 

a and 6 are associates => 0(a) = 0(6). 

Proof This is a good exercise. However it is unnecessary for Theorem 3 below. 

The following remarkable theorem is the foundation for the results of this section. 

Theorem 3 If R is a Euclidean domain and (ojj) G R n ,t is a non-zero matrix, 
then by elementary row and column operations (ojj) can be transformed to 



/ di 
d 2 



V 



\ 



J 



where each dj ^ 0, and dj|dj + i for 1 < i 
generated by the entries of (a,ij)- 



< m. Also o?! generates the ideal of R 



Proof Let I c R be the ideal generated by the elements of the matrix A = (a^j). 
If E G R n , then the ideal J generated by the elements of EA has J C I. If E is 
invertible, then J = I . In the same manner, if E G Rt is invertible and J is the ideal 
generated by the elements of AE, then J = I. This means that row and column 
operations on A do not change the ideal I. Since R is a PID, there is an element 
d\ with I = diR, and this will turn out to be the d\ displayed in the theorem. 

The matrix (a^) has at least one non-zero element d with 0(d) a miminum. 
However, row and column operations on (a^) may produce elements with smaller 



118 



Appendix Chapter 6 



(j) values. To consolidate this approach, consider matrices obtained from (ay) by a 
finite number of row and column operations. Among these, let (fry) be one which 
has an entry d\ ^ with <f)(d\) a minimum. By elementary operations of type 2, the 
entry d\ may be moved to the (1,1) place in the matrix. Then d\ will divide the other 
entries in the first row, else we could obtain an entry with a smaller value. Thus 
by column operations of type 3, the other entries of the first row may be made zero. 
In a similar manner, by row operations of type 3, the matrix may be changed to the 
following form. 



/ di • • • \ 



V o 



C ij 



Note that d\ divides each Cy, and thus I 
on the size of the matrix. 



diR. The proof now follows by induction 



This is an example of a theorem that is easy to prove playing around at the 
blackboard. Yet it must be a deep theorem because the next two theorems are easy 
consequences. 

Theorem 4 Suppose R is a Euclidean domain, B is a finitely generated free R- 
module and A C B is a non-zero submodule. Then 3 free bases {01,02, ■■■,cit} f° r A 
and {&i, &2> ■■-, b n } for B, with t < n, and such that each Oj = djOj, where each di ^ 0, 
and di|d i+ i for 1 < i < t. Thus B/A « i?/di i?/d 2 © • • • © i?/d 4 © -R n_i . 

Proof By Theorem 3 in the section Splitting Short Exact Sequences, A has a 
free basis {t>i,i>2, •••,«<}• Let {wi,^, ■■■,w n \ be a free basis for B, where n > t. The 
composition 



R f 



A 



B 



W 



('; 



W; 



is represented by a matrix (a^-) G i? n> i where f j = Oi^^i + 02,4^2 + • • • + a n ^w n . By 
the previous theorem, 3 invertible matrixes [/ G i? n and V £ R t such that 
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U(a itj )V 



( d x • • • 
d 2 

: '•• 



V o 



o \ 



d t 



J 



with di\di + \. Since changing the isomorphisms R f ^^ A and B ^^ R n corresponds 
to changing the bases {vi,v 2 , ■■■,Vt} and {101,102, ■■■,w n \, the theorem follows. 

Theorem 5 If R is a Euclidean domain and M is a finitely generated i?-module, 
then M sa R/d x ®R/d 2 ®- ■ -®R/d t ®R m where each rfj 7^ 0, and dj|d i+ i for 1 < % < t. 



Proof By hypothesis 3 a finitely generated free module B and a surjective homo- 



morphism B — ► M 
a s.e.s. and B/A r 



— ► 0. Let ^ be the kernel, so — ► A -±+ B — > M — 
M. The result now follows from the previous theorem. 



is 



The way Theorem 5 is stated, some or all of the elements dj may be units, and for 
such di, R/di = 0. If we assume that no di is a unit, then the elements di, d 2 , ..., d t are 
called invariant factors. They are unique up to associates, but we do not bother with 
that here. If R = Z and we select the di to be positive, they are unique. If R = F[x] 
and we select the di to be monic, then they are unique. The splitting in Theorem 5 
is not the ultimate because the modules R/di may split into the sum of other cyclic 
modules. To prove this we need the following Lemma. 



Lemma Suppose R is a PID and b and c are non-zero non-unit elements of R. 
Suppose b and c are relatively prime, i.e., there is no prime common to their prime 
factorizations. Then bR and cR are comaximal ideals. (See p 108 for comaximal.) 



Proof There exists an o G R with aR 
unit, so R = bR + cR. 



bR + cR. Since a\b and a\c, a is a 



Theorem 6 Suppose R is a PID and d is a non-zero non-unit element of R. 

Assume d = p^p^ 2 • • -pt* is the prime factorization of d (see bottom of p 110). Then 
the natural map R/d -^^R/pl 1 © • • • © R/pP is an isomorphism of i?-modules. 
(The elements pp are called elementary divisors of R/d.) 



Proof If i 7^ j, pi* and Pj 3 are relatively prime. By the Lemma above, they are 
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comaximal and thus by the Chinese Remainder Theorem, the natural map is a ring 
isomorphism (page 108). Since the natural map is also an R- module homomorphism, 
it is an i?-module isomorphism. 

This theorem carries the splitting as far as it can go, as seen by the next exercise. 

Exercise Suppose R is a PID, p G R is a prime element, and s > 1. Then the 
i?-module R/p s has no proper submodule which is a summand. 



Torsion Submodules This will give a little more perspective to this section. 

Definition Suppose Mis a module over a domain R. An element m G M is said 
to be a torsion element if 3 r G R with r / and mr = 0. This is the same as 
saying m is dependent. If R = Z, it is the same as saying m has finite order. Denote 
by T(M) the set of all torsion elements of M. If T(M) = 0, we say that M is torsion 
free. 

Theorem 7 Suppose M is a module over a domain R. Then T(M) is a submodule 
of M and M/T(M) is torsion free. 

Proof This is a simple exercise. 

Theorem 8 Suppose R is a Euclidean domain and M is a finitely generated 
i?-module which is torsion free. Then M is a free R- module, i.e., M ~ R m . 

Proof This follows immediately from Theorem 5. 

Theorem 9 Suppose R is a Euclidean domain and M is a finitely generated 
i?-module. Then the following s.e.s. splits. 



— ► T(M) — > M — ► M/T(M) — ► 

Proof By Theorem 7, M/T(M) is torsion free. By Theorem 8, M/T(M) is a free 
i?-module, and thus there is a splitting map. Of course this theorem is transparent 
anyway, because Theorem 5 gives a splitting of M into a torsion part and a free part. 
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Note It follows from Theorem 9 that 3 a free submodule V of M such that T(M)(B 
V = M. The first summand T(M) is unique, but the complementary summand V is 
not unique. V depends upon the splitting map and is unique only up to isomorphism. 



To complete this section, here are two more theorems that follow from the work 
we have done. 

Theorem 10 Suppose T is a domain and T* is the multiplicative group of units 
of T. If G is a finite subgroup of T*, then G is a cyclic group. Thus if F is a finite 
field, the multiplicative group F* is cyclic. Thus if p is a prime, (Z p )* is cyclic. 

Proof This is a corollary to Theorem 5 with R = Z. The multiplicative group G 
is isomorphic to an additive group Z/di © Z/o?2 © • • • © Z/d t where each di > 1 and 
dj|dj + i for 1 < i < t. Every -u in the additive group has the property that ud t = 0. 
So every g G G is a solution to £ d * — 1 = 0. If t > 1, the equation will have degree 
less than the number of roots, which is impossible. Thus t = 1 and so G is cyclic. 

Exercise For which primes p and g is the group of units (Z p x Z q )* a cyclic group? 

We know from Exercise 2) on page 59 that an invertible matrix over a field is the 
product of elementary matrices. This result also holds for any invertible matrix over 
a Euclidean domain. 

Theorem 11 Suppose R is a Euclidean domain and A G R n is a matrix with 
non-zero determinant. Then by elementary row and column operations, A may be 
transformed to a diagonal matrix 



Mi o \ 

G?2 



where each dj ^ and dj|dj + i for 1 < i < n. Also d\ generates the ideal generated 
by the entries of A. Furthermore A is invertible iff each di is a unit. Thus if A is 
invertible, A is the product of elementary matrices. 
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Proof It follows from Theorem 3 that A may be transformed to a diagonal matrix 
with di\di + \. Since the determinant of A is not zero, it follows that each rf; ^ 0. 
Furthermore, the matrix A is invertible iff the diagonal matrix is invertible, which is 
true iff each di is a unit. If each di is a unit, then the diagonal matrix is the product 
of elementary matrices of type 1. Therefore if A is invertible, it is the product of 
elementary matrices. 

Exercise Let R = Z, A = I n . and D = I . Perform elementary 

operations on A and D to obtain diagonal matrices where the first diagonal element 
divides the second diagonal element. Write D as the product of elementary matri- 
ces. Find the characteristic polynomials of A and D. Find an elementary matrix B 
over Z such that B~ 1 AB is diagonal. Find an invertible matrix C in R2 such that 
C~ l DC is diagonal. Show C cannot be selected in Q2. 

Jordan Blocks 



In this section, we define the two special types of square matrices used in the 
Rational and Jordan canonical forms. Note that the Jordan block B(q) is the sum 
of a scalar matrix and a nilpotent matrix. A Jordan block displays its eigenvalue 
on the diagonal, and is more interesting than the companion matrix C(q). But as 
we shall see later, the Rational canonical form will always exist, while the Jordan 
canonical form will exist iff the characteristic polynomial factors as the product of 
linear polynomials. 

Suppose R is a commutative ring, q = a + a±x + • • • + a n _ix n ~ 1 + x n G R[x] 
is a monic polynomial of degree n > 1, and V is the i?[rc]-module V = R[x]/q. 
V is a torsion module over the ring R[x], but as an i?-module, V has a free basis 
{l,x,x 2 , . . . ,x n ~ 1 }. (See the last part of the last theorem on page 46.) Multipli- 
cation by x defines an i?-module endomorphism on V, and C(q) will be the ma- 
trix of this endomorphism with respect to this basis. Let T : V — ► V be defined 
by T(v) = vx. If h(x) G R[x], h(T) is the i?-module homomorphism given by 
multiplication by h(x). The homomorphism from R[x]/q to R[x]/q given by 
multiplication by h(x), is zero iff h(x) G qR[x]. That is to say q(T) = clqI + a{T+ 
■ ■ ■ + T n is the zero homomorphism, and h(T) is the zero homomorphism iff 
h(x) G qR[x]. All of this is supposed to make the next theorem transparent. 

Theorem Let V have the free basis {l,x,x 2 , ...,a; n ~ 1 }. The companion matrix 
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123 



representing T is 



C(q) 



/O 

1 ... 
1 



\o 








1 —a r 



-a \ 

-a i 
-a 2 



The characteristic polynomial of C(q) is q, and |C(g)| = (— l) n Oo- Finally, if h(x) G 
R[x], h(C(q)) is zero iff h(x) G qR[x]. 

Theorem Suppose A G R and q(x) = (x — X) n . Let V have the free basis 
{1, (x — X),(x — A) 2 , . . . , (x — A) n_1 }. Then the matrix representing T is 



B(q) 



( A \ 

1 A ... 

o i a ; 



V o i x J 



The characteristic polynomial of B(q) is q, and |-B(g)| = X n = (— l) n ao- Finally, if 
h(x) G R[x], h(B(q)) is zero iff h(x) G qR[x]. 

Note For n = 1, C(ao + x) = -B(ao + x) = (— Oo). This is the only case where a 
block matrix may be the zero matrix. 

Note In B(q), if you wish to have the I s above the diagonal, reverse the order of 
the basis for V. 



Jordan Canonical Form 



We are finally ready to prove the Rational and Jordan forms. Using the previous 
sections, all that's left to do is to put the pieces together. (For an overview of Jordan 
form, read first the section in Chapter 5, page 96.) 
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Suppose R is a commutative ring, V is an i?-module, and T : V —> V is an 
i?-module homomorphism. Define a scalar multiplication V x R[x] —> V by 
v(ao + a\X + • • • + a r x r ) = vao + T{v)a\ + • • • + T r (v)a r . 

Theorem 1 Under this scalar multiplication, V is an i?[x]-module. 

This is just an observation, but it is one of the great tricks in mathematics. 
Questions about the transformation T are transferred to questions about the module 
V over the ring R[x\. And in the case R is a field, R[x] is a Euclidean domain and so 
we know almost everything about V as an R[x] -module. 

Now in this section, we suppose R is a field F, V is a finitely generated F-module, 
T : V — > V" is a linear transformation and V is an F[x]-module with wx = T(v). Our 
goal is to select a basis for V such that the matrix representing T is in some simple 
form. A submodule of Vp[ x ] is a submodule of Vp which is invariant under T. We 
know Vf[ x ] is the sum of cyclic modules from Theorems 5 and 6 in the section on 
Euclidean Domains. Since V is finitely generated as an F-module, the free part of 
this decomposition will be zero. In the section on Jordan Blocks, a basis is selected 
for these cyclic modules and the matrix representing T is described. This gives the 
Rational Canonical Form and that is all there is to it. If all the eigenvalues for T are 
in F, we pick another basis for each of the cyclic modules (see the second theorem in 
the section on Jordan Blocks). Then the matrix representing T is called the Jordan 
Canonical Form. Now we say all this again with a little more detail. 



From Theorem 5 in the section on Euclidean Domains, it follows that 

V F[x] « F[x]/d! © F[x]/d 2 • • • ® F[x]/d t 
where each di is a monic polynomial of degree > 1, and di\di+\. Pick {l,x,x 2 , r 



ra—l' 



as the F-basis for F[x]/di where m is the degree of the polynomial di. 
Theorem 2 With respect to this basis, the matrix representing T is 



/ C{d 1 



C{d 2 



C(dt) J 
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The characteristic polynomial of T is p = d\d 2 ■ ■ ■ d t and p(T) 
of canonical form but it does not seem to have a name. 



0. This is a type 



Now we apply Theorem 6 to each F[x\/di. This gives Vp[ x ] ~ F[x]/p S i ® ' ' ' ® 
F[x\/p s r r where the pi are irreducible monic polynomials of degree at least 1. The pi 
need not be distinct. Pick an F-basis for each F[x]/p^ as before. 

Theorem 3 With respect to this basis, the matrix representing T is 



/ C{p{ 1 



C(p s 2 2 



C(p?) J 



The characteristic polynomial of T is p = p^ 1 ■ ■ -p s r r and p(T) = 0. This is called 
the Rational canonical form for T. 

Now suppose the characteristic polynomial of T factors in F[x] as the product of 
linear polynomials. Thus in the Theorem above, p t = x — \ and 



V F[X] « F[x]/{x - A0 S1 



f[x]/{x - x r y 



is an isomorphism of F[x]-modules. Pick {1, (x — X{), (x — Aj) 2 , . . . , (x — A«) m 1 } as 
the F-basis for F[x]/(x — \i) St where m is Sj. 

Theorem 4 With respect to this basis, the matrix representing T is 



/ B^x-x^: 



B((x-\ 2 y*) 



B(( x -\ r yr) J 
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The characteristic polynomial of T is p = (x — \\) Sl • • • (x — X r ) Sr and p(T) = 0. This 
is called the Jordan canonical form for T. Note that the A» need not be distinct. 

Note A diagonal matrix is in Rational canonical form and in Jordan canonical 
form. This is the case where each block is one by one. Of course a diagonal matrix 
is about as canonical as you can get. Note also that if a matrix is in Jordan form, 
its trace is the sum of the eigenvalues and its determinant is the product of the 
eigenvalues. Finally, this section is loosely written, so it is important to use the 
transpose principle to write three other versions of the last two theorems. 



Exercise Suppose F is a field of characteristic and T G F n has trace(T J ) = 
for < i < n. Show T is nilpotent. Let p G F[x] be the characteristic polynomial of 
T. The polynomial p may not factor into linears in F[x], and thus T may have no 
conjugate in F n which is in Jordan form. However this exercise can still be worked 
using Jordan form. This is based on the fact that there exists a field F containing F 
as a subfield, such that p factors into linears in F[x]. This fact is not proved in this 
book, but it is assumed for this exercise. So 3 an invertible matrix U G F n so that 
U~ 1 TU is in Jordan form, and of course, T is nilpotent iff U~ 1 TU is nilpotent. The 
point is that it sufficies to consider the case where T is in Jordan form, and to show 
the diagonal elements are all zero. 

So suppose T is in Jordan form and trace (T*) = for 1 < i < n. Thus trace 
(p(T)) = a$n where Oo is the constant term of p(x). We know p(T) = and thus 
trace (p(T)) = 0, and thus aon = 0. Since the field has characteristic 0, a^ = 
and so is an eigenvalue of T. This means that one block of T is a strictly lower 
triangular matrix. Removing this block leaves a smaller matrix which still satisfies 
the hypothesis, and the result follows by induction on the size of T. This exercise 
illustrates the power and facility of Jordan form. It also has a cute corollary. 

Corollary Suppose F is a field of characteristic 0, n > 1, and (Ai, A2, ••, A n ) G F n 
satisfies \\ + \ l 2 + • • +\ % n = for each 1 < i < n. Then A; = for 1 < i < n. 



Minimal polynomials To conclude this section here are a few comments on the 
minimal polynomial of a linear transformation. This part should be studied only if 
you need it. Suppose V is an n-dimensional vector space over a field F and T : V — ► V 
is a linear transformation. As before we make V a module over F[x] with T(v) = vx. 
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Definition Ann(Vpi x ]) is the set of all h G F[x] which annihilate V , i.e., which 
satisfy Vh = 0. This is a non-zero ideal of F[x] and is thus generated by a unique 
monic polynomial u(x) G F(x), Ann(VF[ x ]) = w^[x]. The polynomial u is called the 
minimal polynomial of T. Note that u(T) = and if h(x) G F[x], h(T) = iff 
h is a multiple of u in F[x\. If p(x) G F[x] is the characteristic polynomial of T, 
p(T) = and thus p is a multiple of u. 

Now we state this again in terms of matrices. Suppose A G F n is a matrix 
representing T. Then -u(A) = and if h(x) G i^x], fr(A) = iff h is a multiple of 
u in F[x]. If p(x) G F[x] is the characteristic polynomial of A, then p(A) = and 
thus p is a multiple of u. The polynomial it is also called the minimal polynomial of 
A. Note that these properties hold for any matrix representing T, and thus similar 
matrices have the same minimal polynomial. If A is given to start with, use the linear 
transformation T : F n — ► F n determined by A to define the polynomial u. 

Now suppose g 6 F[i] is a monic polynomial and C(q) G F n is the compan- 
ion matrix defined in the section Jordan Blocks. Whenever q(x) = (x — A) n , let 
B(q) G F n be the Jordan block matrix also defined in that section. Recall that q is 
the characteristic polynomial and the minimal polynomial of each of these matrices. 
This together with the rational form and the Jordan form will allow us to understand 
the relation of the minimal polynomial to the characteristic polynomial. 

Exercise Suppose Ai G F ni has qi as its characteristic polynomial and its minimal 



polynomial, and A 



Mi o \ 

A 2 



. Find the characteristic polynomial 



V A r J 

and the minimal polynomial of A. 

Exercise Suppose A E F n . 



1) Suppose A is the matrix displayed in Theorem 2 above. Find the characteristic 
and minimal polynomials of A. 

2) Suppose A is the matrix displayed in Theorem 3 above. Find the characteristic 
and minimal polynomials of A. 

3) Suppose A is the matrix displayed in Theorem 4 above. Find the characteristic 
and minimal polynomials of A. 
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4) Suppose A G F. Show A is a root of the characteristic polynomial of A iff A 
is a root of the minimal polynomial of A. Show that if A is a root, its order 
in the characteristic polynomial is at least as large as its order in the minimal 
polynomial. 

5) Suppose F is a field containing F as a subfield. Show that the minimal poly- 
nomial of A G F n is the same as the minimal polynomial of A considered as a 
matrix in F n . (This funny looking exercise is a little delicate.) 



5 


-1 


3 





2 





3 


1 


-1 



6) Let F = R and A = 2 |. Find the characteristic and minimal 

polynomials of A. 

Determinants 



In the chapter on matrices, it is stated without proof that the determinant of the 
product is the product of the determinants (see page 63). The purpose of this section 
is to give a proof of this. We suppose R is a commutative ring, C is an i?-module, 
n > 2, and B 1 , B 2 , . . . , B n is a sequence of i?-modules. 

Definition A map / : Bi © B 2 © • • • © B n — ► C is R- multilinear means that if 
1 < i < n, and bj G Bj for j ^ i, then / |(&i, 62, ... , Bi, . . . , b n ) defines an i?-linear 
map from Bi to C. 

Theorem The set of all i?-multilinear maps is an i?-module. 

Proof From the first exercise in Chapter 5, the set of all functions from B\ © B 2 © 
• • • © B n to C is an i?-module (see page 69). It must be seen that the R- multilinear 
maps form a submodule. It is easy to see that if f\ and f 2 are i?-multilinear, so is 
/1 + f 2 . Also if / is /^-multilinear and r G R, then (fr) is i?-multilinear. 

From here on, suppose Bi = B 2 = ■ ■ ■ = B n = B . 

Definition 

1) / is symmetric means /(&i, . . . , b n ) = /(& T (i), ■ ■ ■ , b T ( n )) for all 
permutations r on {1, 2, ... , n}. 

2) / is skew- symmetric if /(&i, . . . , b n ) = sign(r)/(6 r (i), . . . , 6 T (n)) for all r. 
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3) / is alternating if / (&i, . . . , b n ) = whenever some 6j = bj for i 7^ j. 

Theorem 

i) Each of these three types defines a submodule of the set of all 

i?-multilinear maps, 
ii) Alternating => skew-symmetric, 
iii) If no element of C has order 2, then alternating •<=>■ skew-symmetric. 

Proof Part i) is immediate. To prove ii), assume / is alternating. It sufficies to 
show that f(bi, ...,b n ) = —f(b T m, ...,b T r n )) where r is a transposition. For simplicity 
assume r = (1,2). Then = f(b x + b 2 ,h + b 2 , b 3 , ..., b n ) = f(b 1 ,b 2 ,b 3 ,...,b n ) + 
f(b 2 ,bi,b 3 , ...,b n ) and the result follows. To prove iii), suppose / is skew symmetric 
and no element of C has order 2, and show / is alternating. Suppose for convenience 
that b\ = b 2 and show f(b\, 61, 63, ... , b n ) = 0. If we let r be the transposition (1, 2), 
we get /(&i, 61, b 3 , ... , b n ) = -f(h, b u b 3 , ... , b n ), and so 2/(6i, 61, 63, ... , b n ) = 0, 
and the result follows. 

Now we are ready for determinant. Suppose C = R. In this case multilinear 
maps are usually called multilinear forms. Suppose B is R n with the canonical basis 
{ei, e 2 , . . . , e n }. (We think of a matrix A £ R n as n column vectors, i.e., as an element 
of B © B © • • • © B.) First we recall the definition of determinant. 

Suppose A = (ojj) G R n . Define d : B®B®- ■ -®B — ► i? by d(ai,iei + a2,ie2 + - • • + 
a n ,ie n , , ai, n ei + a 2 ^ n e 2 + • • • + a„ in e n ) = ]T all T sign(r)(a T ( 1 ) il o T (2),2 ■ ■ ■ a T (n),n) = |^|- 

The next theorem follows from the section on determinants on page 61. 

Theorem d is an alternating multilinear form with d(ei, e 2 , . . . , e n ) = 1. 

If c G i?, dc is an alternating multilinear form, because the set of alternating forms 
is an R- module. It turns out that this is all of them, as seen by the following theorem. 



Theorem Suppose f : B (B B (B . . . (B B ^ R is an alternating multilinear form. 
Then / = df(e\, e 2 , . . . , e n ). This means / is the multilinear form d times the scalar 
f(ei,e 2 , ...,e n ). In other words, if A = (a^j) G R n , then /(ai^ei + a2,ie2 + • • • + 
a n ,ie n , , a\, n e 2 + a2, n e 2 H + a nin e n ) = |A|/(ei, e 2 , ..., e n ). Thus the set of alter- 
nating forms is a free R- module of dimension 1, and the determinant is a generator. 
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Proof For n = 2, you can simply write it out. /(oi^ei + ai^e-i, a\, 2 &\ + ^2,262) = 
«i,i«i,2/(ei, ei) + 01,102,2/(61, e 2 ) + a 2A a lj2 f(e 2 , ei) + a 2A a 2t2 f(e 2 , e 2 ) = (01,102,2- 

Oi,202,i)/(ei, e 2 ) = |A|/(ei, e 2 ). For the general case, /(oi,iei + a 2 ,ie 2 H h 

a n ,i e n, , a>i, n ei + a 2 , n e 2 -\ h o n , n e n ) = S 0*1,10*2,2 • ■ -o^n/^, e i2 , ..., e in ) where 

the sum is over all 1 < i\ < n, 1 < ^2 < w, ..., 1 < i n < n. However, if any i s = i t 
for s 7^ t, that term is because / is alternating. Therefore the sum is 

jUSt Sail r Or(l),l a r(2),2 ' ' " O r ( n ), n /(e T (i), e T ( 2 ), ■ ■ • , e T ( n )) = Sail r sign(r)a T (i) ) i 

a r (2),2 • • • a T („),„/(ei, e 2 , . . . , e n ) = |A|/(ei, e 2 , •••, e n ). 

This incredible classification of these alternating forms makes the proof of the 
following theorem easy. (See the third theorem on page 63.) 

Theorem If C, A G R n , then \CA\ = \C\\A\. 

Proof Suppose C G R n . Define / : R n —>■ R by f(A) = \CA\. In the notation of 
the previous theorem, B = R n and R n = R n R n • • • © iT. If A G R n , A = 
(A 1 ,A 2 ,...,A n ) where A,- L G i? n is column i of A, and / : R n © • • • © R n -^ R 
has f(A 1 ,A 2 ,...,A n ) = \CA\. Use the fact that CA = (CA 1 ,CA 2 ,...,CA n ) to 
show that / is an alternating multilinear form. By the previous theorem, f(A) = 
|^4|/(ei, e 2 , ..., e„). Since /(ei, e 2 , ..., e„) = |CJ| = |C|, it follows that \CA\ = f(A) = 
\A\\C\. 

Dual Spaces 



The concept of dual module is basic, not only in algebra, but also in other areas 
such as differential geometry and topology. If V is a finitely generated vector space 
over a field F, its dual V* is defined as V*= Hornp(V, F). V* is isomorphic to V, but 
in general there is no natural isomorphism from V to V*. However there is a natural 
isomorphism from V to V** , and so V* is the dual of V and V may be considered 
to be the dual of V*. This remarkable fact has many expressions in mathematics. 
For example, a tangent plane to a differentiable manifold is a real vector space. The 
union of these spaces is the tangent bundle, while the union of the dual spaces is the 
cotangent bundle. Thus the tangent (cotangent) bundle may be considered to be the 
dual of the cotangent (tangent) bundle. The sections of the tangent bundle are called 
vector fields while the sections of the cotangent bundle are called 1-forms. 

In algebraic topology, homology groups are derived from chain complexes, while 
cohomology groups are derived from the dual chain complexes. The sum of the 
cohomology groups forms a ring, while the sum of the homology groups does not. 
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Thus the concept of dual module has considerable power. We develop here the basic 
theory of dual modules. 

Suppose R is a commutative ring and W is an i?-module. 

Definition If M is an i?-module, let H(M) be the .R-module H(M)=Eom R (M, W). 
If M and TV are i?-modules and g : M — ► N is an i?-module homomorphism, let 
H(g) : H(N) -► H(M) be defined by H(g)(f) = f o g. Note that H(g) is an 
i?-module homomorphism. 



9 



M 



H{g){f) = fog 



N 



f 



W 



Theorem 



i) If Mi and M 2 are i?-modules, H(M 1 M 2 ) « #(Mi) #(M 2 ). 

ii) If / : M -> M is the identity, then #(/) : #(M) -> #(M) is the 
identity. 

h 



iii) If Mi 



Mo 



M3 are i?-module homomorphisms, then H(g)oH{h) 



H (h o g). If / : M3 — ► IT 7 is a homomorphism, then 
(#(<?) o #(/>))(/) = H(hog)(f) = foho g. 




f ohog 



Note In the language of the category theory, if is a contravariant functor from 
the category of i?-modules to itself. 
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Theorem If M and iV are i?-modules and g : M — ► A^ is an isomorphism, then 
H(g) : H(N) -> H(M) is an isomorphism with H(g~ l ) = H(g)~ l . 



Proof 



I H{N) = H(I N ) = H(g o g-i) = Hig- 1 ) o H(g) 
I H(M) = H(I M ) = H{g~ l o g) = H(g) o H{g~ l ) 



Theorem 



i) If g : M — > N is a surjective homomorphism, then H(g) : H(N) — > H(M) 
is injective. 

ii) If g : M —>■ N is an injective homomorphism and g(M) is a summand 
of N, then H(g) : H(N) -> H(M) is surjective. 

iii) If R is a field and g : M —>■ N is a homomorphism, then g is surjective 
(injective) iff H(g) is injective (surjective). 

Proof This is a good exercise. 

For the remainder of this section, suppose W = Rr. In this case H(M) = 
Eom R (M,R) is denoted by H(M) = M* and H(g) is denoted by H(g) = g*. 

Theorem Suppose M has a finite free basis {vi, ...,v n }. Define v* G M* by 

v*(v\ri + • • • + v n r n ) = Ti. Thus v*(vj) = 8 it j. Then v$, . . . , i>* is a free basis for 
M*, called the dual basis. Therefore M* is free and is isomorphic to M. 

( \ 



Proof First consider the case of R n = R n> i, with basis {ei, . . . , e n } where e; 



1; 



V o J 

We know (R n )* fa i? ln , i.e., any homomorphism from R n to R is given by a 1 x n 
matrix. Now R\^ n is free with dual basis {e^, . . . , e* } where e* = (0, . . . , 0, lj, 0, . . . , 0). 
For the general case, let g : R n -^ M be given by g(e,) = V; L . Then g* : M* — ► (i? n )* 
sends w* to e*. Since ^* is an isomorphism, {t^, . . . , v *} is a basis for M*. 

Theorem Suppose M is a free module with a basis {vi, . . . , v m } and A^ is a free 
module with a basis {u>i, . . . ,w n } and g : M ^ N is the homomorphism given by 
A = (ajj) G R n ,m- This means ^(fj) = ciijWi + • • • + a n jW n . Then the matrix of 
g* : N* — » M* with respect to the dual bases, is given by A*. 
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Proof Note that g*(w*) is a homomorphism from M to R. Evaluation on Vj gives 
9*{w*){vj) = (w*og)( Vj ) = w*(g(vj)) = <(oijWi + • • ■ + a nd w n ) = atj. Thus g*(w*) 



a>i,ivl 



and thus g* is represented by A 1 . 



Exercise If U is an i?-module, define (f>u : U* (B U — ► i? by (/>u(f,u) = f(u). 
Show that 0u is i?-bilinear. Suppose g : M — ► N is an i?-module homomorphism, 
f <E N* and v G M. Show that (f)N(f,g(v)) = (f) M (g*(f),v). Now suppose M = 
iV = R n and g : i? n — > R n is represented by a matrix A G i? n . Suppose / G (i? n )* 
and d G i? n . Use the theorem above to show that 4> : (R n )* © R n — > i? has the 
property <f>(f,Av) = (f)(A t f,v). This is with the elements of i? n and (R n )* written as 
column vectors. If the elements of R n are written as column vectors and the elements 
of (R n )* are written as row vectors, the formula is (f>(f,Av) = (f>(fA,v). Of course 
this is just the matrix product fAv. Dual spaces are confusing, and this exercise 
should be worked out completely. 

Definition "Double dual" is a "covariant" functor, i.e., if g : M — ► iV is 

a homomorphism, then g** : M** — ► A^**. For any module M, define a : M —y M** 
by a(m) : M* -^ R is the homomorphism which sends / G M* to /(m) G i?, i.e., 
«(m) is given by evaluation at m. Note that a is a homomorphism. 

Theorem If M and A^ are i?-modules and g : M — ► A^ is a homomorphism, then 
the following diagram is commutative. 



M 



o 



A^ 



n 



M** 



N** 



Proof On M, a is given by a(v) = (/>m(—,v). On N, a(u) 
The proof follows from the equation <f>N{f,g{v)) = 4>M{g*{f),v). 



<Pn(-,u) 



Theorem If M is a free i?-module with a finite basis {v\, . . . ,v n }, then 

a : M —* M** is an isomorphism. 

Proof {a(v i), . . . , a(v n )} is the dual basis of {v*, . . . , f *}, i.e., a(i>j) = (w*)*. 
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Note Suppose R is a field and C is the category of finitely generated vector spaces 
over R. In the language of category theory, a is a natural equivalence between the 
identity functor and the double dual functor. 

Note For finitely generated vector spaces, a is used to identify V and V**. Under 
this identification V* is the dual of V and V is the dual of V*. Also, if {vi, . . . ,v n } 
is a basis for V and {v*, . . . , i>*} its dual basis, then {vi, . . . , v n } is the dual basis 
for {d*,...,<}. 

In general there is no natural way to identify V and V*. However for real inner 
product spaces there is. 

Theorem Let R = R and V be an n- dimensional real inner product space. 
Then j3 : V — > V* given by /3(v) = (v, — ) is an isomorphism. 

Proof /3 is injective and V and V* have the same dimension. 

Note If p is used to identify V with V*, then (j) V : V* © V — >• R is just the dot 
product V © V — ► R. 

Note If {w l7 . . . , v n } is any orthonormal basis for V, {/3(i>i), . . . , /?(f n )} is the dual 
basis of {v\, . . . ,v n }, that is /3(vi) = v*. The isomorphism j3 : V —>■ V* defines an 
inner product on V*, and under this structure, /3 is an isometry. If {vi, . . . ,v n } is 
an orthonormal basis for V, {v*, . . . ,i>*} is an orthonormal basis for V*. Also, if U 
is another n-dimensional IPS and / : V — ► [/ is an isometry, then /* : [/* — >• V* 
is an isometry and the following diagram commutes. 

V V* 



f 



r 



r- P , f -: 



Exercise Suppose R is a commutative ring, T is an infinite index set, and 
for each t E T, R t = R. Show (0i?*)* is isomorphic to R T = Y[R t - Now let 

teT teT 

T = Z+, R = R, and M = 0R t . Show M* is not isomorphic to M. 

teT 
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Abelian group, 20, 71 
Algebraically closed field, 46, 97 
Alternating group, 32 
Ascending chain condition, 112 
Associate elements in a domain, 47, 109 
Automorphism 

of groups, 29 

of modules, 70 

of rings, 43 
Axiom of choice, 10 

Basis or free basis 

canonical or standard for R n , 72, 79 

of a module, 78, 83 
Bijective or one-to-one correspondence^ 
Binary operation, 19 
Boolean algebras, 52 
Boolean rings, 51 

Cancellation law 

in a group, 20 

in a ring, 39 
Cartesian product, 2, 11 
Cayley's theorem, 31 
Cayley-Hamilton theorem, 66, 98, 125 
Center of group, 22 
Change of basis, 83 
Characteristic of a ring, 50 
Characteristic polynomial 

of a homomorphism, 85, 95 

of a matrix, 66 
Chinese remainder theorem, 50, 108 
Classical adjoint of a matrix, 63 



Cofactor of a matrix, 62 

Comaximal ideals, 108, 120 

Commutative ring, 37 

Complex numbers, 1, 40, 46, 47, 97, 104 

Conjugate, 64 

Conjugation by a unit, 44 

Contravariant functor, 131 

Coproduct or sum of modules, 76 

Coset, 24, 42, 74 

Cycle, 32 

Cyclic 

group, 23 

module, 107 

Determinant 

of a homomorphism, 85 

of a matrix, 60, 128 
Diagonal matrix, 56 
Dimension of a free module, 83 
Division algorithm, 45 
Domain 

euclidean, 116 

integral domain, 39 

of a function, 5 

principal ideal, 46 

unique factorization, 111 
Dual basis, 132 
Dual spaces, 130 

Eigenvalues, 95 
Eigenvectors, 95 
Elementary divisors, 119, 120 
Elementary matrices, 58 
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Elementary operations, 57, 122 
Endomorphism of a module, 70 
Equivalence class, 4 
Equivalence relation, 4 
Euclidean algorithm, 14 
Euclidean domain, 116 
Evaluation map, 47, 49 
Even permutation, 32 
Exponential of a matrix, 106 

Factorization domain (FD), 111 

Fermat's little theorem, 50 

Field, 39 

Formal power series, 113 

Fourier series, 100 

Free basis, 72, 78, 79, 83 

Free i?-module, 78 

Function or map, 6 

bijective, 7 

inject ive, 7 

surjective, 7 
Function space Y T 

as a group, 22, 36 

as a module, 69 

as a ring, 44 

as a set, 12 
Fundamental theorem of algebra, 46 

Gauss, 113 

General linear group GL n (R), 55 
Generating sequence in a module, 78 
Generators of Z n , 40 
Geometry of determinant, 90 
Gram-Schmidt orthonormalization, 100 
Graph of a function, 6 
Greatest common divisor, 15 
Group, 19 

abelian, 20 

additive, 20 

cyclic, 23 



multiplicative, 19 
symmetric, 31 

Hausdorff maximality principle, 3, 87, 

109 
Hilbert, 113 

Homogeneous equation, 60 
Homormophism 

of groups, 23 

of rings, 42 

of modules, 69 
Homomorphism of quotient 

group, 29 

module, 74 

ring, 44 

Ideal 

left, 41 

maximal, 109 

of a ring, 41 

prime, 109 

principal, 42, 46 

right, 41 
Idempotent element in a ring, 49, 51 
Image of a function, 7 
Independent sequence in a module, 78 
Index of a subgroup, 25 
Index set, 2 
Induction, 13 

Injective or one-to-one, 7, 79 
Inner product spaces, 98 
Integers mod n, 27, 40 
Integers, 1, 14 
Invariant factors, 119 
Inverse image, 7 

Invertible or non-singular matrix, 55 
Irreducible element, 47, 110 
Isometries of a square, 26, 34 
Isometry, 101 
Isomorphism 
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of groups, 29 
of modules, 70 
of rings, 43 

Jacobian matrix, 91 

Jordan block, 96, 123 

Jordan canonical form, 96, 123, 125 

Kernel, 28, 43, 70 

Least common multiple, 17, 18 
Linear combination, 78 
Linear ordering, 3 
Linear transformation, 85 

Matrix 

elementary, 58 

invertible, 55 

representing a linear transformation, 
84 

triangular, 56 
Maximal 

ideal, 109 

independent sequence, 86, 87 

monotonic subcollection, 4 

subgroup, 114 
Minimal polynomial, 127 
Minor of a matrix, 62 
Module over a ring, 68 
Monomial, 48 

Monotonic collection of sets, 4 
Multilinear forms, 129 
Multiplicative group of a finite field, 121 

Nilpotent 

element, 56 

homomorphism, 93 
Noetherian ring, 112 
Normal subgroup, 26 

Odd permutation, 32 
Onto or surjective, 7, 79 



Order of an element or group, 23 
Orthogonal group 0(n), 102 
Orthogonal vectors, 99 
Orthonormal sequence, 99 

Partial ordering, 3 
Partition of a set, 5 
Permutation, 31 
Pigeonhole principle, 8, 39 
Polynomial ring, 45 
Power set, 12 
Prime 

element, 110 

ideal, 109 

integer, 16 
Principal ideal domain (PID), 46 
Principal ideal, 42 
Product 

of groups, 34, 35 

of modules, 75 

of rings, 49 

of sets, 2, 11 
Projection maps, 11 

Quotient group, 27 
Quotient module, 74 
Quotient ring, 42 

Range of a function, 6 
Rank of a matrix, 59, 89 
Rational canonical form, 107, 125 
Relation, 3 
Relatively prime 

integers, 16 

elements in a PID, 119 
Right and left inverses of functions, 10 
Ring, 38 

Root of a polynomial, 46 
Row echelon form, 59 

Scalar matrix, 57 
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Scalar multiplication, 21, 38, 54, 71 

Self adjoint, 103, 105 

Short exact sequence, 115 

Sign of a permutation, 60 

Similar matrices, 64 

Solutions of equations, 9, 59, 81 

Splitting map, 114 

Standard basis for R n , 72, 79 

Strips (horizontal and vertical), 8 

Subgroup, 14, 21 

Submodule, 69 

Subring, 41 

Summand of a module, 77, 115 

Surjective or onto, 7, 79 

Symmetric groups, 31 

Symmetric matrix, 103 

Torsion element of a module, 121 
Trace 

of a homormophism, 85 

of a matrix, 65 
Transpose of a matrix, 56, 103, 132 
Transposition, 32 

Unique factorization, 

in principal ideal domains, 113 

of integers, 16 
Unique factorization domain (UFD), 111 
Unit in a ring, 38 

Vector space, 67, 85 

Volume preserving homomorphism, 90 

Zero divisor in a ring, 39 



